SOTONTV) 8ΟΘΝΈΗΣΙ, 10 SLNAWA IA 


@ 


ZOILMOIOUYOTT 


$123 


ELEMENTS OF 
TENSOR CALCULUS 


A. Lichnerowl1cz 


METHUEN'S MONOGRAPHS ON 


PHYSICAL SUBJECTS 


‘Elements of Tensor Calculus 
A. LICHNEROWICZ 


ee THE HARRIS COLLEGE “ἢ 


A translation from the French of a work by CORPORATION STREET, 
an internationally acknowledged authority on PRESTON. 
the subject. 

Part I contains a rigorous presentation of 
tensor calculus as a natural development of | 
vector analysis. Part II provides an intro- 
duction to the most important applications 


All Books must be Returned to the College Library not 
later than the last date shown below. 


of tensor calculus. It begins with the use of me viet νὴ 
tensors in classical analytical dynamics, and a 086) ΜῊ 86 .2 NEP 1992 
detailed exposition of the role of tensors in | on wiv 1980 ek tees 
special relativity theory is then given. The eas heh 

book concludes with a brief presentation 

of the field equations of general relativity | 

ὑϑον β -2. OCT. 1987 

The author is Professeur au College de France | 70,461. 1987 

and the translator is Senior Lecturer in Physics 

in the University of London, at Queen Mary a 

College. « 


LONDON: 
METHUEN & CO LTD 
ᾧ 
NEW YORK: WY ἃ 515.63 L 
To ot % C 016019 


JOHN WILEY & SONS INC 


oe ΠΝ i | ill ἢ 


30107 


METHUEN’S MONOGRAPHS 


ON PHYSICAL SUBJECTS 


General Editor: B. L. WORSNOP, B.sc., PH.D. 


ELEMENTS OF TENSOR CALCULUS 


Elements of 
Tensor Calculus 


by A. Lichnerowicz 


Professeur au College de France 


translated by 


J. W. LEECH 


Senior Lecturer in Physics 
Queen Mary College, University of London 


and 


D. J. NEWMAN 


Lecturer in Physics 
Queen Mary College, University of London 


LONDON: METHUEN & CO LTD 
NEW YORK: JOHN WILEY & SONS INC 


First published in 1950 as 
‘Eléments de Calcul Tensoriel’ by 
Librairie Armand Colin, Paris 
© 1950 by Max Leclerc et Cie, Paris 
| English translation of 4th revised Contents 
) edition (1958) first published in 1962 
© 1962 by Methuen & Co Ltd 
Printed in Great Britain by Preface 
| Spottiswoode Ballantyne & Co Ltd 
London & Colchester PART I: TENSOR CALCULUS 


Catalogue No. (Methuen) 2/4313/11 


page Vii 


I Vector Spaces 1 


| I Concept of a vector space, §§1-4. II n-dimen- 
sional vector spaces, §§5-9. ΠῚ Duality, §§ 10-13. 
IV Euclidean vector spaces, §§ 14-24. 


Il Affine Euclidean Point Spaces, §§25-29 24 


fA AAS cborty- cb 
ΠῚ Tensor Algebra 29 


I Concept of a tensor product, §§30-32. IT Affine 
tensors, §§33-40. III Euclidean tensors, §§41-45. 


IV Outer products, §§46-52. 
) ἰροια. | IV Curvilinear Coordinates in Euclidean Space 53 
βογιε9 I Derivatives and differentials of vectors and 
ny Ht fot points, §§53—55. II Curvilinear coordinates in a 
i . Be . Euclidean point space, §§ 56-59. III Christoffel 
᾿ ΣΥΝ symbols, §§60—63. IV Absolute differentials and 
621666 y covariant derivatives, §§64—66. V Differential 


operators in curvilinear coordinates, §§67-—70. 


V_ Riemannian Spaces 73 


I Tangential and osculating Euclidean metrics, 
§§71-76. Π The transport of Euclidean metrics, 
§§77-79. ΠῚ Curvature tensor of a Riemannian 
space, §§ 80-86. 

ν 


Contents 
PART II: APPLICATIONS 


Tensor Calculus and Classical Dynamics page 98 
I Dynamics of holonomic systems with time- 
independent constraints, §§87-92. II Dynamics of 
holonomic systems with time-dependent 

constraints, §§93-96. ΠῚ Dynamics of continuous 
media, §§97-103. 


Special Relativity and Maxwell’s Equations 119 


I Physical principles, §§ 104-106. II The Lorentz 
group and Minkowski space-time, §§ 107-111. 
ΠῚ Dynamics in special relativity, §§ 112-115. 

IV Relativistic dynamics of continuous media, 
§§ 116-118. V The Maxwell-Lorentz equations, 
§§ 119-126. 


Elements of the Relativistic Theory of Gravitation, 


§§ 127-131 154 
Bibliography 160 
Index 162 


Preface 


In 1900 Ricci and Levi-Civita produced a celebrated mémoire which 
gave the first systematic account of tensor calculus and drew the 
attention of mathematicians and physicists to some of its applications. 
Since then much has happened. The appearance of the theory of rela- 
tivity, which would not have been possible without the previous 
existence of tensor calculus, gave it, in turn, an immense impetus. 
Tensor calculus has now become one of the essential techniques of 
modern theoretical physics. It has even been used recently in the 
study of technical problems such as the interconnection of electrical 
machines. It can be said that tensor calculus now forms a fundamen- 
tal part of mathematics and physics. 

This little book is divided into two parts, one concerned with tensor 
algebra and analysis, the other with the most important applications. 
In Part I the study of tensor algebra ends with a brief consideration of 
outer product algebras, since this technique deserves to be better 
known by physicists. On the other hand, the concept of tensor density, 
which is of little mathematical interest, is not introduced. This con- 
cept is, in fact, easily avoided by the introduction of adjoint tensors of 
antisymmetric tensors. 

In the chapters on tensor analysis I have confined myself to the 
analysis of tensor fields in Riemannian spaces, since Riemannian 
geometry is the most interesting from the point of view of applications. 
I have used systematically the method of ‘transported reference 
frames’ due to M. Elie Cartan. This method, which is the most geo- 
metrical and intuitive, has the added advantage of permitting the 
reader to avoid the consideration of other generalized geometries. 

In the section on applications I was forced to be selective. The first 
chapter is intended to show the intuitive nature of Riemannian spaces 
in classical analytical dynamics and their usefulness in this field. In 
Particular, an introduction to the study of continuous media and 
of elasticity is given. The reader wishing to extend his knowledge 
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in this direction should refer to the excellent works of M. Léon 
Brillouin. 

The remaining two chapters are devoted to the study of Maxwell’s 
equations of the electromagnetic field and to the theory of relativity. 
Only a brief sketch is given of the principles of general relativity 
theory. My task will have been accomplished if I have assisted the 
reader to undertake the study of the fundamental theories of contem- 
porary physics. 


Translators’ Note 


In the course of translation some explanatory footnotes have been 
added and a number of references to works not available in English 
have been omitted. The Bibliography has been revised and extended 
in order to provide suitable suggestions for further reading. 


Queen Mary College J. W.L. 
(University of London) D.J.N. 


PART I: TENSOR CALCULUS 


CHAPTER I 


Vector Spaces 


I. CONCEPT OF A VECTOR SPACE 


1. Definition of a vector space. Consider the set of displacement vectors 
of elementary vector analysis. These satisfy the following rules: 


(i) The result of vector addition of any two vectors, x and y, is their 
vector sum, or resultant, x+y. Vector addition has the following 
properties: 


(a) x+y = y+x (commutative property); 

(δὴ) x+(y+z) = (x+y)+z (associative property); 

(c) there exists a zero vector denoted by 0 such that x+ O=x; 

(d) for every vector x there is a corresponding negative vector (—x), 
such that x+(—x) = 0. 


(ii) The result of multiplying a vector x by a real scalar « is a vector 
denoted by «x. Scalar multiplication has the following properties: 


(a’) Ix=x; 

(b’) «(Bx) = («B)x (associative property) ; 

(c’) («+ )x = «x+ Px (distributive property for scalar addition); 
(d’) «(x-+y) = ox+ay (distributive property for vector addition). 


Using the above properties as a guide, we now consider a general 
set E of arbitrary elements x, y etc., which obey the following rules: 

(1) To every pair x, y, there corresponds an element x+y having the 
properties (a), (6), (c), (4). 

(2) To every combination of an element x and a real number « there 
corresponds an element «x having the properties (a’), (b’), (c’), (47. 

We then say that Eis a vector space over the field of real numbers and 
that the elements x, y, etc., are vectors in E. If the second rule holds for 
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all complex numbers « then E is a vector space over the field of com- 
plex numbers. Except when otherwise stated we shall confine our- 


selves in this book to the study of vector spaces over the field of real 
numbers, 


2. Examples of vector spaces. There are several other simple examples 
of vector spaces which may be quoted to give an idea of the interest 
and application of the general concept. 

(a) Consider the set of complex numbers a+ ib, where a and & are 
real. The addition of any two complex numbers (a+-ib, c+-id, etc.) 
and the multiplication of a complex number by a real number « 
obviously have the properties listed in §1. It follows that the set of 
complex numbers constitutes a vector space over the field of real 
numbers. 


(6) Let X be an array of n real numbers arranged in definite order 
X = (X),X2,...,X,) 

and let E be the set of all arrays X. Assume the following two rules of 
composition: 

If X = (x4,X,...,x,) and Y= (¥1,¥2,---,¥,,) then 

X+Y = (+31, X2+ 2,0, XntVp)- 
If X = (x1,X,...,x,) and if « is any real number then 
OX = (0X1, αχ,..., αχ,). 


It is easily verified that these two rules imply the rules (1) and (2) of 
§1. It then follows that Econstitutes a vector space with respect to the 
field of real numbers. 

(c) Consider the set of real functions of a real variable defined on 
the interval (0, 1) with the usual composition rules for the sum of two 
functions and for the product of a function by a constant «. With 


these rules the set under consideration is a vector space over the field 
of real numbers. 


3. Elementary properties of vector spaces. (/) For any two vectors x 
and y there is one, and only one, vector z such that 


X = ΖΈΥ. (3.1) 
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This is easily seen by adding the vector {—y) to each side of (3.1) 
giving the relation 
z= x+(-y) 


which defines z uniquely. As in elementary algebra we write 
x+(-y) = x-y. 
With this notation the property (c’) of §1 can be written as 
(«—B)x = αχ-- βχ. (3.2) 
In view of this property it follows that 
(a—B)x+ P(x) = («—B8+)x = («+0)x = ox. 
Putting « = B it can immediately be deduced from (3.2) that 


Ox = 0, (3.3) 
and, on writing « = 0 
(-P)x = —Bx 
In particular 
(-1)x = --«. (3.4) 


(2) From (3.4) it follows that the property (d’) of §1 can be rewritten 
in the form 


a(x—y) = ox—ay. (3.5) 
Putting x = y in (3.5) it follows that 
αθ = 0. (3.6) 


(3) Conversely, the relation 
ax = 0 (3.7) 


implies that either « = 0 or x = 0. For, if « is not zero it has an inverse 
«~' and on multiplying both sides of (3.7) by «~! we have 


α΄ ἴ(αχ) = θ, 
or (a'o)x =x =0, 


which is the required result. 
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4. Vector sub-spaces. Definition: A sub-space of a vector space E is 
any part, V, of E which is such that, if x and y belong to V and « is any 
real number, then the vectors x+y and ox also belong to V. 

The commutative, associative and distributive properties of E 
clearly apply to V. The real number « may be zero so that it is clear 
that V necessarily contains the zero vector. Again if x belongs to V, 
(—1)x = —x also belongs to V and it follows that each vector of V 
has a negative in V. The rules of addition and of multiplication by a 
scalar thus have the properties listed in §1, therefore V itself is a 
vector space. 

Some simple examples of vector sub-spaces may be given. 

(a) The set of vectors coplanar with two given vectors constitutes 
a sub-space of the vector space of elementary geometry. 

(6) If x is a non-zero vector of a vector space E, the set of products 
«x, where « is any real number, constitutes a sub-space of E. 

(c) Theset of real functions of a real variable defined on the interval 
(0, 1) forms a vector space over the field of real numbers. The bounded 
functions of a real variable defined in the same way form a sub-space 
of this vector space since the sum of two bounded functions and the 
product of a bounded function by a constant are themselves bounded. 


Il. n-DIMENSIONAL VECTOR SPACES 


5. Basis of a vector space. Let x), Χγ,...» x, be p non-zero vectors ina 
vector space E. These vectors are said to form a linearly independent 
system of order pif it is impossible to find p numbers a, «>,..., o%,, not 
all zero, such that 


α᾽ ΧΙ +O2X2+...+a,X, = 0. 


In the contrary case the given system of p vectors is said to be 
linearly dependent. 

Consider the set of all systems of linearly independent vectors in 
the vector space E. There are two possibilities — either (a) there exist 
linearly independent systems of arbitrarily large order, or (b) the order 
of the linearly independent systems is bounded. 

In the second case the vector space is said to have a finite number of 
dimensions. This classification will be explained shortly. In the 
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remainder of this book we shall only consider vector spaces which 
have a finite number of dimensions. In this case it is possible to deter- 
mine an integer m such that there exist linearly independent systems 
of order n but not of order (n+ 1). If (€;,€2,-.-,€,) is any such system 
of order nit will be called a basis of Ein conformity with the following 
n: 

bape of a vector space E is any linearly independent system of 
maximum order. 

Let x be any vector in Ε. The system of (+1) vectors (x,e;, 
€>,.++;@,) is necessarily linearly dependent, so there exist (1+ 1) 
numbers A, «1, 0%2,...,%, such that 


AK + fy Oy + α20,χ Ὁ... Ape, = 0. (5.1) 


λ must be different from zero, otherwise the system e; will not be 
linearly independent. Equation (5.1) can thus be solved for x and 
there exist 2 numbers x!, x*,..., x” such that 


x = x'e,+x7e2+...4+2%"e, (5.2) 


and the vector x is expressible as a linear combination of the e;. More- 
over, this combination is unique for, if there existed another, their 
difference would constitute a linear combination of the e; equal to the 
null vector and with coefficients not all zero. This conflicts with the 
original postulates. This result may be stated formally: 

THEOREM: Given a basis of E, any vector x of E can be represented 
in a unique way as a linear combination of the vectors of this basis. 

The numbers (x!,x?,...,x”) which appear in (5.2) are called the 
components of x with respect to the basis (€;,€2,..-,€n). 

It is easy to show that the property stated in the above theorem 
specifies bases amongst all systems of vectors. Let (€;,€2,...,€,) be a 
system of p vectors such that any vector x of E can be expressed 
uniquely in the form 


x = xle;+x7e2+...+x7e,. (5.3) 
In particular the null vector 0 can be expressed in one way only 
(x! = x? =,..= x? =0) as a linear combination of the vectors of the 


system. It follows that such a system must be linearly independent 
and that p <n. 
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It is clear that the same property holds for all linearly independent 
systems of order p. Let (€;,€5,.. .»€,) be such a system. εἰ can be 
written 

ει = axe; + &2 0+ eee + αρθ,- 
Suppose that «, (for example) is non zero, then 
e; = B,€,+f,e,+...+8,e,, 


and on substituting in (5.3) it is seen that any vector x of E can be 
expressed as a linear combination of (€;,e5,..., »)- In particular 


€ = γ᾽ €j ἘΎ262... Ἔγ,θρ. 


with at least one of the numbers y,..., Y, different from zero, other- 

wise the system of €;’s cannot be linearly independent. Repeating this 

procedure we find that any vector x of Ecan be expressed in the form 
x= fete +...+£e, 

It follows immediately that there cannot exist any linearly in- 
dependent systems of order (p+1), consequently p=n. We now 
express this formally: 

THEOREM: For a system of vectors to constitute a basis of E it is 
necessary and sufficient that any vector of E can be expressed in one, 
and only one, way as a linear combination of the vectors of that 
system. 

The number n is termed the dimension of the vector space under 


consideration. We shall, in future, use E,, to denote an n-dimensional 
vector space. 


6. Examples. (a) In the vector space of elementary geometry a basis 
is formed by any three non-coplanar vectors. This space is therefore 
three dimensional. 


(6) Take the example (δ) of §2 and consider the vectors 
Θι = (1, 0, ** -, 9), 


(6.1) 
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Any vector 
x = (x! x?,...,x”) 


can be expressed in one, and only one, way as a linear combination of 
the e;: 
x = x'e,+x7e5+...+x"e,. 


It follows that these vectors constitute a basis for E, which is thus an 
n-dimensional space. 


7. Vector sub-spaces of E,,. Let V be a vector sub-space of E,,. Then V 
is a vector space in which any linearly independent system of vectors 
is also a linearly independent system of E,,. It follows that V has a 
finite number of dimensions r <n. Let (€;,€),...,€,) be a basis of 
V. Any vector of V can be put in one, and only one, way into the form 


fe+26,+£¢,4+...+é'e,. (7.1) 


Whatever the numbers é', £7, ..., ξ΄, a vector in the form of (7.1) 
belongs to V. 

Conversely, if (€;,€,...,€,) is a linearly independent system of 
order rin E,, it is clear that the set of all vectors which can be expressed 
as a linear combination of the ε; constitute a vector sub-space of E,,. 

Any n-dimensional vector sub-space of E,, coincides with E,,. 


8. Complementary vector sub-spaces. Let us consider a linearly 
independent system (€),€>,...,€,) of order r < n. We wish to complete 
this system by adding (n—r) new vectors (η, ,;,..-,N,,) such that the 
system of vectors €; and ἢ; constitutes a basis of E,,. 

There certainly exists in E,, at least one vector such that the system 
formed by adjoining it to the ε; is a linearly independent system; for if 
not, the system ε; would itself be a basis of E,,. Let ἡ,.. 1 be this vector 
So that (€;,€,...,€,,,.~4) is a linearly independent system of order 
(r+ 1). Repeating this procedure gives systems of increasing order and 
it will stop only when the order reaches n, the dimensionality of E,,. 
We now express this formally: 

THEOREM: Given a linearly independent system of order r it is always 
Possible to complete the system with (n—r) vectors to obtain a basis for 


"" 
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Let U, be a vector sub-space of E,, having r < n dimensions and let 
(€),€,...,€,) be any basis of U,. Using the preceding theorem n 
vectors (1,41,--+;%,) can be found (in point of fact in an infinite 
number of ways) such that the ε; and ἡ; define a basis of E,,. Let the 
vector sub-space generated by the ἡ; be denoted by Κ᾽... 

Evidently the two sub-spaces U, and V,,_ , have only the zero vector 
in common. Moreover any vector x of E, which is expressible in the 
form 


r π 
x= Di aet+ > By; 
i=! j=r+l 


can be decomposed into the sum 
x=y+z 


of a vector y of U, and a vector z of V,,_,. This decomposition is 
evidently unique according to the preceding remarks. In view of this 
property the vector sub-spaces U, and V,,_, are said to be comple- 
mentary. This may be expressed formally: 

THEOREM: To every vector sub-space U, of E,, there corresponds a 
unique complementary sub-space V,,_,. 


9. Change of basis. It follows from the first theorem of §8 that a 
vector space Κ᾿ has an infinity of bases. We propose to determine the 
relations that exist between the components of a particular vector x 
with respect to two distinct bases. 

Let (€;,€2,€3...,€,) and (e€;/,e-,...,¢,/)f be two arbitrary bases of 
E,,. Expressing the vectors of each basis in terms of those of the other 
we write 


7 ; 
ey = > 4.6, a= x Al ey. (9.1) 


7 It is common practice to distinguish bases (and their associated co- 
efficients) by dashes or bars attached to the symbols themselves rather than 
to the indices. This is simpler to write, but can easily lead to confusion, since 
the two indices attached to the transformation coefficients refer to different 
bases. (T.) 
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Let x be an arbitrary vector in E,, having components x’ relative to 
the basis (e;) and x’ relative to (6}). We have 


"=n 
x= Dxe= Σ χε = Σ x! Abe, (9.2) 
i,j’ 


Equating the coefficients of the e; appearing in the second and last 
terms of (9.2) gives the transformation formula 


=n , 
xis Σ Apx’. (9.3) 
y= | 
On interchanging the roles played by the two bases we can also write 


x! = > Aj x? (9.4) 


III. DUALITY 


10. Linear forms. Let E,, denote a vector space of nm dimensions and 
suppose that to each vector x of E,, there corresponds a quantity F(x), 
the relationship being such that for any x, y of E,, and for any real « 
we have 

F(x+y) = F(x)+F(y), (10.1) 


F(ax) = aF(x). (10.2) 


F(x) is then said to be a linear form defined on E,, and (10.1), (10.2) are 
the relations which characterize such forms. 

With the help of (10.1) and (10.2) it is easy to obtain an expression 
for F(x) in terms of the components of the vector x with respect to a 
basis (e;). Representing an arbitrary vector of E,, by 


i=n ; 
x= > x e; 
i=1 


it follows, using (10.1) and (10.2) successively, that 


F(x) = ΣΦ). (10.3) 
i=} 


oe 
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where the F(e;) are independent of x. This may also be written 


i=n 
F(x) = = a;x' [a; = F(e)). 


11. Dual spaces. Consider the set of linear forms defined on E,, and 
denote its members by y*, z*, etc. Assume in addition the following 
two rules of composition: 

(J) If y*@) = Σ yj x! and z*(x)= ¥ ζῇ χ' denote two arbitrary 
linear forms, then 


i=n 
y*(x)+2%(x) = Y OF +27) x4 
i=1 
(2) If « denotes any real number, then 
i=n 
ay*(x) = p> (αν ἢ) χ᾽, 


It is clear that these rules are in accordance with the requirements of 
§1. Hence the set of linear forms defined on E,, constitutes a vector 
space, Furthermore every linear form can be expressed in one, and 
only one, way as a linear combination of the n forms (x',x?,...,x"). 
This system of ἡ forms thus constitutes a basis for the vector space 
under consideration which is therefore n-dimensional. 

The vector space (ΕΖ) of the linear forms defined on E,,, and conform- 
ing to (1) and (2) above, is called the vector space dual to E,,. 


12. Dual of the basis. Corresponding to each basis (e;,€>,...,¢,,) of E,, 
we have set up in a canonical manner a basis (x!,x*,...,x”) of E*. 
This will be called the dual of the basis (e;).t If a change of basis is 
effected on ΚΕ; 


i=n ; j=n : 


+ The dual basis arrived at in the manner indicated obviously depends 
upon the choice of the vector x. The vector space E¥ dual to E,, is, however, 
uniquely defined by E,,. (T.) 
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a corresponding change of basis defined by (9.3), (9.4) is simultane- 
ously effected on E** such that 


jJ'=n : ἵ i=n 
xis > Apr’, x = 3 atx (12.2) 
jot i=1 


Under this change of basis the components γ᾽ of the form y*(x) 
transform according to the relations 


i=n jf=n | 
p= VA, Vi = ΟΣ 41}. (12.3) 


It is clear that each basis of Ey can be considered as the dual of a 
basis in E,. Let (y*",y**,...,y*”) be an arbitrary basis in E*. The 
forms (x',x*,...,x”) can be related to this basis as follows 


The system of n vectors e, defined by 


ej = = 4! ς; 


is linearly independent and consequently forms an alternative basis 
to e; in E,, for which the dual basis in ΕἾ is (y*"’, y*?’,..., y*”). 


13. Biduality. Assuming the existence of the vector spaces E,, and 
Ex referred to the dual bases (e;) and (y*/ = x’), let us consider the 
space ΕΠ dual to Εἰ. Designate by z** the elements of the basis in 
E;* which is dual to the basis (y*') in E*. Since the basis (y*4) is dual 
to (e,) and (z}"*) is dual to (νυ "Ὁ, it follows from (12.1) and (12.2) that 
the e; and the zj* transform according to the same rule. As a conse- 
quence, if each vector x of E,, given by 


i=n 


x= > xe, (13.1) 
i=1 
is made to correspond to the element z** of E** given by 


i=n 
zt = > χ᾽", (13.2) 
i=] 
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then this correspondence is independent of the choice of basis. It is also 
manifestly /inear, in the sense that addition and scalar multiplication 
remain invariant: to the sum of two elements in E,, there corresponds 
the sum of two elements in £** and to the product of an element in 
E,, by « there corresponds the product of an element in E;* by «. 

Since there is no intrinsic interest in the elements of E;* we find it 
convenient in what follows to identify the vector space E,;* with Ε, and 
to consider as identical the elements x and z** which correspond 
according to (13.1) and (13.2). In particular, the basis (z}*) is identi- 
fied with the basis (e;) so that the dual basis relation becomes a 
reciprocal one.f 


+ An example drawn from elementary vector analysis may help to clarify 
the concept of duality. 
Let us write 
F(x) = F-x, 


where F is an arbitrary vector. Then the function F(x) clearly satisfies (10.1) 
and (10.2) and is therefore a linear form defined on 3-dimensional Euclidean 
space (E;). Introducing the components of F and x we have 


Χ = x,i+x,j+x;k and F = F,\i+F,j+F,;k, 
whence 
F(x) = ΧΙ Fy t+X2Fo+Xx3F3. 


The set of all functions F defines the space, Εὖ, dual to E;. The elements 
of ἘΣ may alternatively be represented as y*, z*, etc., where 


y*(x) = γῆ χ = χ ΧΕ Ἔχε J}. 


These functions y*, z*, etc., obviously satisfy (1) and (2) οἵδ 11 and therefore 
E} is a vector space with the basis (x;,x2,X3). 

When we go on to define bidual spaces it is clear that the special rule for 
the formation of elements introduced above cannot be used precisely as 
before since y* is not a vector in the normal restricted sense of the term. 
(It is, of course, an element of E¥ and therefore a vector according to our 
general definition.) The elements of the dual space, E; *, of E¥ may, however, 
be defined by 

Ὁ) = yizi*+y2 22° +y323" 
and if we further specify that 
zi = i,etc., 


then the basis of Ε: is bidual to itself and ΕΥ̓ δ is identical with E;. (T.) 
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IV. EUCLIDEAN VECTOR SPACES 


14. The summation convention. It is apparent in all the formulae 
given so far that summations are always effected with respect to an 
index repeated twice, once as a superscript and once as a subscript. 
For ease of notation Einstein proposed that in these circumstances 
the summation sign should be suppressed and the following conven- 
tion adopted: 

The Einstein summation convention: Whenever, in the same term, 
the same index appears twice, once as a superscript and once as a 
subscript, a summation is implied over all terms obtained by letting that 
index assume all its possible values, unless an explicit statement is 
made to the contrary. 

With this convention formulae (9.3) and (9.4) take the form 


x= Apx’, x! = Af x’, (14.1) 
whilst formulae (12.3) can be written 
yj = Aby?, yt = Af γῇ. (14.2) 


The reader will easily convince himself that, far from making 
algebraic manipulations more difficult to follow, such a convention 
greatly simplifies the presentation and understanding of the formulae 
of tensor calculus once some familiarity has been established. 


15. Definition of a Euclidean vector space. Consider first the vector 
space of elementary geometry. For each pair of vectors x, y the process 
of scalar multiplication determines a number denoted by x- y whichis 
called their scalar product. Scalar multiplication has the following 
properties: 


(a) x-y = y-x (commutative property); 

(6) (ax)-y = x-(ay) = a(x-y) (associative property with respect to 
multiplication by a scalar «); 

(c) x-(y+z) = x‘y+x-z (distributive property with respect to 
vector addition); 

(4) If x-y = 0 for arbitrary x then y = 0. 
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Consider, in general, a vector space E,,, defined over the field of real 
numbers. Suppose there exists a rule of composition which gives for 
every pair of vectors x, y, a correspondence with a real number x-y 
having the properties (a), (6), (ὦ), (4). 

We then say that E,, is a Euclidean vector space and that the rules of 
composition (a), (b), (c), (d), define scalar multiplication in that space. 

It follows from (a), (5), (c), that the scalar product of two vectors is 
a bilinear form with respect to these vectors; that is a linear form with 
respect to either of them. From the commutative condition it follows 
that this bilinear form is symmetric in the two vectors. We now 
investigate how the scalar product may be expressed analytically. 

Let the Euclidean space E,, be referred to an arbitrary basis (e;) and 
let 

x = x'e;, y = ye; 


be any two vectors in E,. Forming the scalar product and allowing 
for (δ) and (c) we find that 


xy = x'y/e;-e;. 


We are thus led to consider the scalar products of pairs of basis 
vectors, and find it convenient to write 


ee; = gi. (15.1) 
Due to the commutative property of the scalar product, the quan- 
tities g;; are symmetric in their indices: 
Si = Sii- 
Using this notation we have: 
ΧῪ = gyx'y’, 


It remains to consider the implications of (d). If x-y = 0 for any x 
we have for arbitrary values of the x! 


gyx'y! = 0. 


If, in this equation, we put successively x;, x2,..., χη equal to one with 
the remaining x’s zero the following relations are obtained: 


gy = 0. 
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These constitute a system of a linear equations in m unknowns which 
admit only the solutions γ΄ = 0 according to (d). It follows that 


det g;; x 0, 


and the bilinear form is said to be non-degenerate. This may be 
stated as follows: 

THEOREM: The scalar product of two vectors in a Euclidean space 
is given by the symmetric, non-degenerate, bilinear form 


xy = gyx'y’, (15.2) 


where the g;; represent the scalar products (e;-e i) Of the basis vectors. 
It is clear that any sub-space of a Euclidean space is itself Euclidean. 


16. Orthogonality and the norm. Let x and y be two vectors whose 
scalar product is zero. In the special case where the Euclidean space 
under consideration is that of elementary geometry, either x or y is 
zero or they are mutually perpendicular; in either case they are said 
to be orthogonal. In the general case of an n-dimensional Euclidean 
space x and y are again said to be orthogonal if 


ΧῪ = 0. (16.1) 


The scalar product of the vector x by itself is called the norm (or 
square) of the vector. We write 


Nx = (x)* = gyx'x!, (16.2) 


The norm of a vector x in Euclidean space is thus given in terms of 
its components by the quadratic form (16.2). A vector whose norm is 
equal to 1 is said to be unitary or normalized. 

In the space of elementary geometry the norm of a vector is strictly 
Positive and it vanishes only if the vector is zero. If the coefficients gi 
of the quadratic form (16.2) can take any value this is no longer true. 
We are therefore led to the following definition: 

A vector space is said to be properly Euclidean if it is Euclidean and 
the norm is strictly positive for every non-zero vector. 

In order that the quantity (16.2) be strictly positive for every non- 
zero vector it is both necessary and sufficient that the quadratic form 
&,;x'x/ be positive definite. 


᾿ 
| 
| 
] 
Ι 
ἰ 
Ι 
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17. The Schwarz inequality and its applications. Consider a proper 
Euclidean space P,,. The norm Nx being always positive or zero, its 
square root is termed the modulus of the vector and we write 


a/(Nx) = |x|. 


A fundamental inequality exists between the moduli of two 
vectors of P,, and the modulus of their scalar product: 


[x-y| < |x| ly]. (17.1) 


This is known as the Schwarz inequality. 
In order to establish this inequality we start with the vector Ax+y, 
where A denotes an arbitrary real number. We have 


(Ax+y)? = A?x?+2dx-y+y”. (17.2) 


Since the left-hand side is necessarily positive or zero, the discriminant 
of the quadratic expression in A appearing on the right-hand side can 
only be negative or zero. We therefore have the inequality 

(x+y)? < x’y’, 


which is equivalent to (17.1). 
An inequality relating to the modulus of the sum of two vectors can 
easily be derived from (17.1). Putting A = 1 in (17.2) 


(x+y)? = x?+2x-y+y*. 


Maximizing the absolute value of x-y using the Schwarz inequality 
gives 
(x+y)? < |x|?+2|x| ly] +ly]? = ([x|+ly))?, 
hence 
Ix+y| < |x|+ ly]. (17.3) 
Finally the Schwarz inequality enables us to define the angle 


between two vectors of P,. In elementary geometry the angle ¢ 
between two vectors is related to their scalar product by 


xy = |x| |y|cos¢, 


or cos¢ = x‘y/|x| ly]. 
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Returning to P,,, the Schwarz inequality shows that for two non- 

zero vectors x, y of P,, 
x*y/|x| |y| < 1. 
It follows that in the range (0,77) there is one, and only one, angle ¢ 
such that 
x'y 

[x| ΠΣ] 
This angle is by definition the angle between two vectors x and y of 


P,,. If x and y are defined by their components it follows from (15.2) 
and (16,2) that 
gyx' γ᾽ 


88 = Teer) VEY) ve 


cos = (17.4) 


18. Orthonormal systems of vectors. Let P,, be a proper Euclidean 
n-dimensional space. A system of r vectors of P,, is said to be ortho- 
gonal and normalized (or, more briefly, orthonormal) if all the vectors 
in the system are normalized and mutually orthogonal. If (e;,e2,...,e,) 
are the vectors of the system then 


e;"e j= δὴ (i,j = | ae RABE ἢ ὁ (18.1) 
where §,=0 ifi#j 
=1 ifi=/j. 


It is clear that every orthonormal system of vectors is necessarily 
linearly independent. If this were not true there would exist a relation 
of the form 

i=r 


dX %e; = 0, (18.2) 
i=1 
where the «;were not all zero. Suppose αι, for example, to be different 
from zero. Taking the scalar product of the members of (18.2) with 
e,;, we find using (18.1) that 


a, = 0, 
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which is contrary to our hypothesis. It follows that the number of 
vectors of an orthonormal system is less than or equal to the number 
of dimensions of the space n. If r = the orthonormal system under 
consideration forms an orthonormal basis of P,,. 


19. The Schmidt orthogonalization procedure. One can ask whether, 
given an arbitrary integer r < ἡ, orthonormal systems composed of r 
vectors can be found. We shall now show, using a method due to E. 
Schmidt, that such systems can indeed be found. 

Let (x;,X>,...,X,) be a linearly independent system of order r in 
P,, and let U, be the corresponding vector sub-space. It is always 
possible to construct an orthonormal system whose vectors are 
linear combinations of the vectors x; in U,. To this end we define the 
set of vectors y; as follows: 


v1 4 
¥2 = λέγ, ἜΧ) 
¥3 = λέγ, +Ajy2+x3 ? 


y, = Aly, +Azyo+...+A7—-!y,_1+x, 


where the A/ denote coefficients which we propose to determine in 
such a way that each y; is orthogonal to all the preceding y,’s. Using 
the relation 
Yo"y, = 0 
we deduce 
My?+x>y; = 0 where yi γέ 0. 


This determines A} and we obtain a y2 which is both orthogonal to 
y; and non-zero, because the system (y;,X>,...,X,) is linearly inde- 
pendent. 

Using the relations 


y3°y; = 9, y3°¥2 = 0, 
we obtain 
Al y?+x3-y, =Q with y; τέ 0; 


AMy3+x3'y. =0 with y3 τὸ 0. 
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This determines a vector y3 which is orthogonal to both y, and yp. It 
is also non-zero because the system (y;,¥,X3,...,X,) is linearly inde- 
pendent according to the definition of y>. 

Continuing in this way a system of vectors (y;,y>,...,y,) is ob- 
tained, none of which is zero and which are all mutually orthogonal. 
Dividing each of these vectors by its modulus we obtain the required 
orthonormal system of vectors 


ae 
lyil 

20. The space P,, referred to an orthonormal basis. Let a proper 
Euclidean space P,, be referred to an orthonormal basis (e;,e>,...,€,,) 
so that 


ee; = δὴ) (20.1) 
Writing 
x = x/e, 
and taking the scalar product of each side with e;, we find 
x! = x-e,. (20.2) 


Since the basis is orthonormal the scalar product of two vectors is 
given by 


ΧῪ = xi pl4x?yp?4...4x"y", (20.3) 
and the norm of a vector becomes 
Nx = (x!)?+(x?)?+...+(x"). (20.4) 


Equations (20.3) and (20.4) will be recognized as the generalization 
to m-dimensions of the familiar formulae of elementary vector 
analysis. 

We note, moreover, that it is always possible to bring a given 
Positive definite form g,,x‘x/ into the form of (20.4) by a suitable 
change of basis. 


21. Contrayariant and covariant components of a vector. Let E,, denote 
ἃ Euclidean vector space which is referred to an arbitrary basis 
(€;,€5,...,e,). We have seen in the preceding section that, if this basis 
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is orthonormal, the components x‘ of the vector x are equal to the 
scalar products of x by the vectors of the basis. If the basis is not 
orthonormal this is no longer true. We are thus led to the following 
definition. 

DEFINITION: Given an arbitrary basis (€;,€>,...,€,) of the Euclid- 
ean vector space E,,, 

(J) the term contravariant components of a vectorx referred to this 
basis is applied to the numbers x' which are such that 


x = χίε, (21.1) 


(2) the term covariant components of a vector x referred to this basis 
is applied to the scalar products 


x, = Xe. (21.2) 


In what follows, contravariant components are always represented 
by means of superscripts and covariant ones by subscripts. The 
reasons for this distinction will appear shortly (§23). 

The covariant components of a vector may be determined readily 
from a knowledge of its contravariant components. From (21.1) we 
have 


x = x/¢,. 
Forming the vector product of each side with 6, it follows that 
x, = eye,x!, 
which becomes, on introducing the Si 
X;= gyx’ @=1,2,...,n). | (21.3) 


Conversely, let us seek to express the contravariant components of 
a vector in terms of its covariant components. We require the solution 
of the system of linear equations in m unknowns x/ given by 


Syx! = χι. (21.4) 


From the considerations of §15 the determinant of the g, is different 
from zero and the system (21.4) is therefore a Cramer’s system. In 


ΤΑ system of linear equations with a unique solution. (T.) 
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the following we write the determinant of the g;;as g and the coefficient 
of σὺ in the development of g as αἷϊ. using Cramer’s rule, it follows 
that 


ji 
xi = =x, 
g 
7 
Writing hig (21.5) 
we get the fundamental relations 
x! = εἶχ, (21.6) 


Since the determinant formed from the elements g;; is symmetric, 
we have «/! = αὖ and consequently the quantities ¢4 defined by (21.5) 
are also symmetric with respect to their indices. Moreover, from a 
well-known result of the theory of determinants 


det αὐ = g™—!) 


1 
hence det οὕ = εἰ (21.7) 


22. Expressions for scalar product and norm in terms of covariant 
components. The use of covariant components leads to particularly 
simple expressions for the scalar product and norm in terms of the 
components referred to an arbitrary basis. 

Substituting (21.3) into the relation 


xy = gyx'y/ 
it follows that 
xy = x,y! = x'y,. (22.1) 
We also find, for the norm of a vector x, 
Nx = x'x;. (22.2) 


On the other hand, the scalar product of two vectors and the norm 
of a vector can be expressed uniquely in terms of the covariant com- 
ponents of the vectors. Using (21.6) we have 


ΧῪ = g! x;y, (22.3) 
and Nx = g!x;x; (22.4) 
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Because of (21.7) the bilinear form (22.3) and the quadratic form 
(22.4) are both non-degenerate. 


23. The effect of a change of basis on the contravariant and covariant 
components of a vector. We have just seen that if the Euclidean vector 
space is referred to an arbitrary basis the scalar product of two 
vectors x, y can be expressed in the form 


xy = x,y’. 


If x is a constant vector, the scalar product x-y defines a linear 
form with respect to the arbitrary vector y. This linear form has the 
components x; relative to the basis (y',y*,...,»”) which is dual to 
(€1,€2, eee en). 

Consider a change in the basis of E,, defined by 


@e,=Aley, (δ) er = Abe, (23.1) 


We have seen that the contravariant components x’ of a vector trans- 
form according to the formulae 


(a) x'= Aix’, (ὃ) x” = Af x’. (23.2) 


As for the covariant components x;, we have seen that these transform 
according to the formulae (12.3) for the components of a linear form, 
thus 


(ὦ x,;= Af xp, (ὃ xy = Ab x;. (23.3) 


It will be noted that the linear expressions appearing on the right- 
hand side of (23.1) (a) and (23.3) (a) contain the same coefficients. 
This is also true of (23.1) (6) and (23.3) (6). On the other hand, (23.2) 
(a) and (23.1) (6) have a similar form. These similarities justify our 
use of the terms covariant and contravariant. 


24. Euclidean vector spaces and duality. If x is a vectorin the Euclidean 
vector space E,, we have seen that the scalar product x-y, where y is an 
arbitrary vector, enables a correspondence to be set up between the 
vector x and a linear form defined on E£,,. Conversely, any linear form 
defined on £, can be considered as the scalar product of a fixed 
vector x with an arbitrary vector of E,,. 
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If the space E,, is referred to an arbitrary basis (e;) and the dual 
space E;* to the dual basis (x'), the element in E,, which has the com- 
ponents γ᾽ corresponds to the element in ΕΖ with the components y;. 
These components are linked by the relations (21.3) and (21.6), i.e. 


γιΞ 50», yw=e"y,. (24.1) 


Thus, the existence of the scalar product enables us to set up a one-to- 
one correspondence between the elements of E,, and E;, and the 
components of these elements bear a simple relationship to one 
another. The Euclidean vector space E,, and its dual ΕΖ are therefore 
considered to be equivalent. 


CHAPTER II 


Affine Euclidean Point Spaces 


25. Definition of an affine space. The points of the space & of elemen- 
tary geometry define vectors — the position vectors of elementary 
vector analysis. These obviously satisfy the relations 


AB = —BA, 
> - -» 
AB = AC+CB. 


Choose an arbitrary point O in &, then each point A of δ᾽ is associ- 
ated with a displacement vector a defined by 


_> 
a = OA. 


Conversely, given an arbitrary displacement vector a, there exists a 
point A such that 
—> 


OA =a. 


In general, consider a manifold of points, δ᾽, and suppose that to 
each pair (A, B) of points of δ᾽ taken in order there corresponds a 


—> 
vector of an n-dimensional vector space E,, denoted by AB. Assume 
this correspondence to have the following properties: 


—> ad 
(a) AB = —BA; 
-» -» —- 
(δ) AB = AC+CB; 
(c) if O is an arbitrary point in δ᾽ then, to every vector a of E,, 
there corresponds a unique point A such that 
—_—> 


OA = a. 


When these conditions hold we say that the manifold δ᾽ is an affine 
point space of n-dimensions. We shall denote it henceforth by the 
24 
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symbol δ᾽,. The space δ᾽, will be termed real or complex according to 
whether the associated vector space E,, is defined over the real or 
complex field. We shall limit future considerations to real affine 
spaces. 


Given a vector denoted by AB we say that A is the origin and B the 
extremity. Condition (c) consequently states that, for all a, the speci- 
fication of a in E,, and an origin O in δ᾽, completely determines the 
extremity A. 

From the above hypotheses it is clear that 


—_— 
AA = 0. 


26. Reference frames of affine spaces. DEFINITION: Given an affine 
space &,, then an arbitrary point O of €, anda basis (€;,€2,..-,€n) Of 
the associated vector space together constitute a reference frame of δ᾽, 
The point O is called the origin of the reference frame. 

A reference frame will henceforward be represented by the nota- 
tion (O,e;,€2,-.-,@,) or, more concisely, by (O,e;). 

Let A denote an arbitrary point of δ᾽, The components of the 


> 

vector OA with respect to the basis (e;) are called the coordinates of A 

in the reference frame (O,e;). By virtue of (c) there is a unique Cor- 

respondence between the sets of 7 real numbers (χ᾽, x’, ...,x”) and the 

points A of &,,. 
Given two points A and B of δ᾽, defined by their coordinates eel ) 


and (y') in the reference frame (O, e;), we wish to define the vector 48. 
From (a) and (δ) we have 


—> - -α -ρο > 


AB = AO+OB = OB-OA. 


=~ 
From this we deduce that the vector AB has the n quantities (y’— x’) 
as components with respect to the basis (e;). 


27. Change of reference frame. We wish to determine the relations 
that exist between the coordinates of a given point M of @,, referred to 
two distinct frames. 
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Let (O,e;) and (O’,e;) be two reference frames of é,,. In order to 
locate each frame with respect to the other we refer the origin of each 
ἫΝ to the other and the vectors of each basis to the other basis. 

e have 


— — 
OO’ το ale; Oo’O = αἵ ey; 
6» = Aye; e; = Af ey. 


Let M be an arbitrary point of &,, whose coordinates are x! in the 
frame (O,e;) and x’ in (O, e;). We have 


--.-- : 
OM = x'e; (27.1) 
also 


-- -- “--- ; 
OM = 00'+0'M = a/e,+x/ ey = (a'+Aix")e; (27.2) 


Identifying the right-hand sides of (27.1) and (27.2), and comparing 
coefficients of e;, we get the transformation formulae 


x) = alt Ab xt’, (27.3) 

Interchanging the roles of the two reference frames we can also 
write 

χ = oS + Af x! (27.4) 


28. Affine sub-spaces. DEFINITION: Let Ὑ be a part of an affine 
—- 


space ὅδ᾽, such that for any point O in V the set of vectors OM (all M 

in ¥’) constitutes a vector sub-space of E,. Vis then said to form an 

affine sub-space of @,,. . 
A sufficient condition for Y to be an affine sub-space is that, fora 


᾿ . — 
particular point O, the vectors OM constitute a vector sub-space 
V, of E,,. If O’ is any other point in Υ 


O’M = OM-OO’ 
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> --- —> 
and since OM and OO’ both belong to V,, so must O’ M. Conversely, 
given a vector a of V,, there exists a unique point M belonging to Υῇ 


which is such that 
---. ο-.«--.--ς... 


OM = OO’ ἃ, 
ame 
which is to say O’M = a. 


It is clear from this definition that if the vector sub-space is r-dimen- 
sional then the corresponding affine sub-space is an r-dimensional 
affine space. We denote this by 7. 

As an example, consider the three-dimensional affine point 
space &, of elementary geometry. The one- and two-dimensional 
affine sub-spaces are then simply the lines and planes of 3. 


29. Euclidean point spaces. DEFINITION: An affine point space which 
is associated with a Euclidean vector space is called a Euclidean point 
space. 

If the associated vector space E,, is properly Euclidean the space 
&,, is also said to be properly Euclidean. Furthermore, the reference 
frame (O,e,) is orthonormal if the basis (e;) of E,, is orthonormal. 

The point space &; of elementary geometry is an obvious example 
of a three-dimensional proper Euclidean point space. 

The concept of distance may be introduced through the following 
definition: The square of the distance between two points A and Bin 


—> — 
&,,is the norm of the vector AB (or of BA) so that 


—_ --» 
N(AB) = (ΑΒ. = (ΑΒ). 


If the space is properly Euclidean, (AB)? is positive definite for two 
distinct points A, B so that the square root is always real and defines 
the distance AB between the points A and B. If A, B, C denote any 
three points in a proper Euclidean space we have 


—> -υ -» 


AC = AB+ BC, 
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and applying the inequality (17.3) it follows that 
AC « AB+BC. (29.1) 


This is known as the triangular inequality. 

Let us return to the general case of a Euclidean point space which 
we suppose to be referred to an arbitrary frame (O, e;). We wish to 
express analytically the square of the distance between two points. 


If M has the coordinates (x’) and N the coordinates (y") the vector MN 
has the components (y/— x‘). Accordingly, using the notation of 
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(MN)* = (MN)? = gj(y'—x'\(/—x’). (29.2) 


Suppose the point N to be in the immediate neighbourhood of M 
and let its coordinates be (x'+dx'), The square (ds)? of the distance 
between M and Nis therefore given by the quadratic differential form 


(ds)* = g,dx' dx’, (29.3) 


where the gy are constants with respect to the (x'), It is clear, in 
particular, that if &, is properly Euclidean and is referred to an 
orthonormal reference system, then 


(ds)? = (dx)? + (dx?)?+...4 (dx), (29.4) 


The reader will recognize this as the generalization to n dimensions of 
the expression for (ds) in terms of orthogonal coordinates in elemen- 
tary geometry. 


CHAPTER III 


Tensor Algebra 


I. CONCEPT OF A TENSOR PRODUCT 


30. Tensor product of two spaces. Let us consider two vector spaces 
E,, and F, of n- and p-dimensions respectively and associate with them 
a vector space of dimensionality mp denoted by £,,® F,. If x and y 
belong to Ε; and F, respectively we can set up a correspondence 
between the pair (x,y) and an element of the vector space ΚΕ, Ὁ F, 
denoted by x ® y. This correspondence has the following properties: 

(a) If x, x1, X2 belong to E,,, and y, y;, y2 to F,, then the distributive 
law holds with respect to vector addition: 


X@(yi+y2) = X@y1+x@yz, 
(x; +X2)@y = ΧΙ @y+x2@y. 
(b) If « is an arbitrary scalar, the associative law holds: 
oxQy = x@ay = a(x @y). 
(c) If (x;,X2,...,X,) and (y;,Y2,.-.,¥,) are any two bases of ΒΕ; and 
F,, respectively the mp elements 
HOF, νιν δῶ = 1,2,...57) 
of Ε, Ὁ F, form a basis in that space. 
When these conditions hold we say that the vector space E,, Ὁ F, is 


the tensor product of the vector spaces E,, and F,, and that the element 
x ®y is the tensor product of the two vectors x and y. 


31, Analytical expression for the tensor product of two vectors. Let 
us see how the three properties given above allow the formulation 
of a rule of composition for the two vectors x and y. Choose any 
three bases (6)), (ἴω), (€;,.) in the vector spaces E,, F, and E,®F, 
(i= 1,2,...,m;«%=1,2,...,p). According to (c) it is possible to write 


een (31.1) 
29 
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x = x'e,, 
(31.2) 


Suppose that 


y = γί. 


are two arbitrary vectors belonging respectively to E,, and F,. Taking 
the tensor product of the right-hand sides of the equations (31.2) and 
using properties (a) and (ὁ) we get 


X@y = x'ye,@f, = x'ye,. (31.3) 


It follows that the quantities x'y* are the components of the tensor 
product x @y with respect to the basis €;,. 

On the other hand, does the composition rule defined by (31.3) 
satisfy property (c) ? Let (x;) and (y,) be two arbitrary bases of E,, and 
F,,; referring the vectors e, to the basis (x;) we have 


e, = ai,.x;. 
Similarly 
fp = beyq. 
Each element T of E,,® F, can be written 


a T= re, = te, @f, (31.4) 
T= t*P ai.x,@ bpyq = 1B a) δέχ, ΘΥς. 


Therefore T can be expressed as a linear combination of the ele- 
ments x;@y,. If T= 0, equation (31.4) shows that εἰ -- 0, and the 
system of np elements x;@y,, is linearly independent, which shows 
that (c) is satisfied. This can be expressed formally as follows: 

THEOREM: ΖΓ the spaces E,, F,, E,®F, are referred to certain 
bases related according to (31.1) then the only composition rule satis- 
Sying the properties of §30 is that which, for a vector x with components 
x! and a vector y with components y%, gives a correspondence with 
the element of E,,® F,, having components x'y*. 


32. Tensor products of several spaces. Tensors. Consider three vector 
spaces E,,, F,,, G, with n, p, q dimensions respectively. If x belongs to 
E,, y to F,,z to G, then the element x @y of Ε, Ὁ F,,can be multiplied 
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tensorially with the element z of G,. The element [x@y]®z of a 
vector space H is thus obtained. We assume that the same element of 
H is obtained by taking the tensor product of x with y Ὁ Ζ, i.e. 


[x@y]@z = x@[y Ὁ 2] (32.1) 


which is the associative property of tensor products. It is only neces- 
sary to assume this property for the basis vectors, then (32.1) follows 
because of the distributive properties (a) and (6) above. We denote the 
common value of the two sides of (32.1) by x ® y @z and the vector 
space H will be represented by E,,® ἢ; Ὁ Gg. 

Given a finite number r of vector spaces E,,, F,,, G,,..., the definition 
of their tensor product follows immediately. As every element of 
E,,® F,® G,®... is not necessarily the tensor product of r vectors 
belonging respectively to E,, F,, G,,...t we formulate the following 
general definition: 

DEFINITION: Each element of the vector space 


E,®F,®G,®... 


constructed from the spaces E,,, F,,, G, is called a tensor. 


Il. AFFINE TENSORS 


33. Affine tensors attached to a vector space. Given an n-dimensional 
space E,, it is possible to construct the tensor product of g spaces 
identical with E,,. The vector space of n? dimensions thus obtained 
will be denoted by E and called the gth tensorial power of E,,. As the 
vector spaces E,, and E™ are referred respectively to the basis vectors 
(εὐ and (€,, ;,... ;,) we adopt the following convention which generalizes 
(31.1) 
€;, De;, DE;j,... Oj, = Cire. . ie 


If X(1), ΧΩ)» «++» X(q) denote any q vectors of E,, with components 
Xi)» X45), «++» X() Fespectively, then 


X(1) @XQ)@... OXq = Xt) Xb) ++ χέει... (33.1) 


+ It may be the sum of several such expressions. (T.) 
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Consider the vector space E;* dual to E,, referred to the basis (x') 
dual to (e;). Starting from these two spaces we can carry out the 
following operation: take arbitrary tensor powers of E,, or E* and 
multiply them tensorially amongst themselves. We then obtain 
tensor products of the type 


EM @ EF @ EM... @ EF) @ EO, (33.2) 


DEFINITION: A tensor is said to be an affine tensor attached to the 
vector space E,,if itis an element of a vector space obtained by form- 
ing a vector product of spaces identical with E,, or its dual. 

If (rytrot...+7n)+(it52+...+5,)=9, the vector space 
(33.2) is g dimensional. Each element of (33.2) is thus called an affine 
tensor of order} q; it is (ry +r2+...+7,,-fold contravariant and 
(s;+52+...+5,)-fold covariant. Each element of E™ is a contra- 
variant affine tensor of order g and each element of E* is a covariant 
affine tensor of order g. In particular it is often said that the elements 
of E,, are contravariant vectors whilst those of E* are covariant 
vectors. Naturally this terminology varies according to which of the 
spaces E,, and E; is considered to be given first. 


34. Components of an affine tensor. In order to represent the com- 
ponents of different affine tensors it is convenient to introduce a special 
notation which will now be explained. We consider, for example, a 
tensor of order q which is an element of the space Ε΄ @ ΕΘ, A 
basis of this vector space is formed by the set of elements 


©, Θ6,, Ὁ... Oe, OX" @x" =, 5.4", (34.1) 


where both sides have the same suffices and superfixes. Let T be a 
tensor element of E@~” @ ΕΠ (2 and refer it to the basis (34.1). In 
order to preserve the summation convention we represent the com- 
ponents of T by #'*#?----* so that 


 ghathecch 
T=" = le—s ig © isis... ies ree: (34.2) 


ἰς-- be 


The indices i;, i2,...i,2 are said to be contravariant and the boxtstg 
are said to be covariant. Mere inspection of the components of a 


Τ The term rank is often used in this context (T.) 
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tensor is therefore sufficient to determine how many times it is contra- 
variant and covariant. Using (34.1) and (34.2) we deduce 


T = ᾿ x > ea μοι) @e;,® ...® ΠΝ @ xe ® x! 
κῶν ας 

where, for clarity, the summation signs have been inserted. 

Each term of the summation on the right-hand side is the tensor ~ 
product οἵᾳ contravariant or covariant vectors and this sum has n?~! 
terms. 

Thus every affine tensor of order q is the sum of at most n*—' tensor 
products of q contravariant or covariant vectors. 

For example, if the tensor T, an element of E™, is the sum of 
p <n‘ tensor products of q vectors it can be expressed as the sum 
of p terms in the form of the right-hand side of (33.1) and its com- 
ponents will be the sum of p terms in the form 


i i i, 
X(1) XQ) ++ + XG 


It is clear that, whatever the type of tensor, all its components have 
the same form. 


35. Change of basis for the components of an affine tensor. Let us again 
consider a tensor T which is an element of the space E¢—” @ E*®), 
If the space E,, is referred to the basis (e,), E* will be referred to the 
dual basis (x‘) and E¢—” @ E* to the basis 


6, Ὁ... Bj, , Ox @x"*, (35.1) 

Now refer E,, to another basis (6) defined by 
6; Ξ Ale, ey = Ape; (35.2) 
The space E@~” @ E*@) will then be referred to the associated basis 
ἐγ, Bey, 9... Ὁ εν... Ὁ χε χῆς, (35.3) 


We wish to determine the components εἶδ ἰδῆς, ; of Τ with respect 
to the first basis in terms of the components 17.27.7 “νος », referred 
to the second, 

3 
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To this end suppose that T is the tensor product of g contravariant 
or covariant vectors. Then 


yee ΝΕ arg yy i 
ge “te-tfe = (1) XQ) +++ X(G—2)X(q—1) ei *Q@)ias (935.4) 


geese τ sg = Xb XO)-- MHD XQ—DieaX@i'e (5.5) 
However, using the transformation formulae for the components of 
a contravariant or covariant vector, we have 


i i , ᾿ εἰς , 
xtiy = Af xdjete.; — X@—1) ea = ALG X@—1j-rete. 


Hence 
εἰ" is... fe ve c= Aj}. Ἢ ΑΓ: pss a shalt oe (35.6) 


or, interchanging the roles of the two bases we obtain 
GPT Aen T arg gat AES oa, ἘΠ". 


As each tensor is to be regarded as the sum of p tensor products and 
the relations (35.6) and (35.7) hold for each component of T, these 
relations also hold for the components of an arbitrary affine tensor 
which is (g—2)-fold contravariant and two-fold covariant. 

Using (35.6) and (35.7) the general transformation rule for the com- 
ponents of any tensor may easily be derived. It is clear that a repetition 
of the above reasoning will, in particular, give the following expres- 
sions for the components of a purely contravariant tensor: 


ἐπ we AD, AGP Tt Se. (35.8) 
PAS? me A AL (35.9) 
The components of a covariant tensor transform according to 
th.te = Al Al tyre (35.10) 
brs γε = App AP hy ...ty (35.11) 


These transformation formulae can be interpreted as follows. Let 
E,, and E™ be referred to associated bases (e;) and (e;, @... @e;,) and 
consider a system of n? quantities r"---"* which transform according 
to (35.8) and (35.9) on passing from the bases (e;), (€;, Θ 6;, Ὁ... Ὁ 6.) 


to (e), (e7,®...®@e;,). A correspondence can be set up between 
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these quantities and a tensor T which has them as components with 
respect to the basis (e;,®...®e;,). The transformation rule then 
defines the same tensor T with respect to an arbitrary basis 


(e7,@...@ e;,). 


THEOREM: Jn order that the πα quantities t"*---"* associated with a 
basis (€;,®...®€;,) of the space E can be considered as the com- 
ponents of some contravariant tensor it is necessary and sufficient that 
the system transforms according to (35.8) and (35.9) under a change of 
basis. 

Similar statements hold for the components of a mixed tensor of 
any kind. 


36. A criterion of tensor character. It is easy to deduce from the pre- 
ceding results a criterion of tensor character which is of great practical 
use. By way of achange we shall consider the case of a purely covariant 
tensor. 

THEOREM: A necessary and sufficient condition for a system of πῇ 
quantities t;, .;,referred to a basis (x" Ὁ... Ὁ x"*) of the space EX to 
be the components of a covariant tensor is that for any contravariant 
vectors [X(1),X(2)-+-X(q)] with components x{ ;) the quantity 


yer i 
lin is..ieX(1)X(Q) +++ XG) 


shall remain invariant with respect to all changes of basis. 
We establish first that the condition is necessary. If the ἔχιν. ;, are 
the components of a covariant tensor they transform according to 


ti αἰ; = Abo. ALty, pi. Se 


and the components of the g given contravariant vectors transform 
according to 


i i Rie s Tee ig k's 
χὠ) == Ax, X(1)3 eee, X(q) -- Ay, X (aq) 
Hence we deduce 


h αἱ ‘sgl kr χκ 
ti, ἴε.. εἰς Χ) Χ()- . XG) = Al? Aj, wilde bitsj'e..S’eX()** »X(g)- 
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Since A/ Aj, represents the components of the e, referred to the 6» 
we have 


; : 0 ifj’ # k’ 
i, = ὃ (= . 
It follows that 
bie. .teXCiye2 XE) = ἕρ,,, γεχ(λ...χ (36.2) 


Conversely, suppose the equality (36.2) to be satisfied for the 
arbitrary set of contravariant vectors [x()...X,)]. Then the com- 
ponents xj, of these vectors transform according to the formulae 


xd} = Αἰ χω; ...; χῷ = Αζ' χῷῳ:- 
We therefore have the relation 
ti, ἌΝ eX(1) eée xis) = Aj? oes Αἰ ty, - eX) see xii). 

It follows that the ὦ, ;, transform according to (35.10). This com- 
pletes the required proof. 

The above theorem can immediately be generalized as follows: 

THEOREM: In order that a system of n” ‘4 quantities t;,_ ;,,,teferred 
to the basis (x" @...@x'°@...@x!'**) of the space E**® can be 
identified with the components of a covariant tensor, it is necessary and 


sufficient that, for p arbitrary contravariant vectors [X (1), Χ()»- “Ὁ X(p)] 
with components x{, the quantities 


i le 
bis tu.sstoless ss εἰ, 1}... «ΣΧ. 


shall be the components of a covariant tensor of order q. 

Let (¥(1),¥(2),-++s¥(q)) be 4 arbitrary contravariant vectors with 
components γί: In order that the quantities f;, ν,.,..... inye Shall be 
the components of a covariant tensor of order p +g it is necessary and 
sufficient that the quantity 


i 
Sis... foleat.oclose*() * Xb VES εν BAN 


shall remain invariant with respect to changes of basis. But this con- 
dition is also the necessary and sufficient condition that the quantities 


i > 
ἔμ. stoingtecsteye*(l)*°* xt) 
shall be the components of a covariant tensor of order g. 
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Analogous statements can be made for the components of an affine 
tensor of any kind. 


37. Affine tensor algebra. We have already met, incidentally, some 
algebraic operations which permit the formation of new tensors 
from known ones. Let us revue these briefly: 

(a) Tensor addition. Given two tensors of the same order and of 
the samekind which are both elements, for example, of E@—” @ EF, 
the procedure of vector addition generates a third tensor of order g 
and of the same kind which is called their sum. If the two tensors 
under consideration have the components f:::’*, , and 
εἶν νῖκος respectively their sum obviously has the components 


ght a <a = εἷ" one canal iet εἶ" tS Fe Ε tee 

(δ) Tensor multiplication. Given two tensors of order 4 and q’ 
which are of any kind, their tensor product generates a tensor of order 
q+q’. If, for example, two tensors have the components fF ἘΣ ie 
and u;,,,"**---'*«’, then their tensor product has components 

p" see wai? _ ae wee lage” ἕν ris ees ert te sevlege ᾿ 

Consider any tensor which is an element, for example, of E®. The 
multiplication in E of this tensor by a scalar is a particular case of 
tensor multiplication if the scalar (or invariant) is regarded as a tensor 
of zero order. We shall adopt this point of view in the future. 


38. Contraction of indices. Besides the two fundamental operations 
given above there exists a third, the contraction of indices, which 
permits the derivation of new tensors of order (4 --- 2) from a given 
mixed tensor of order q. 

Consider first a mixed tensor of order 2 having the components 17}, 
We now show that the quantity 7;/, obtained by summing the com- 
ponents with identical contravariant and covariant indices, is 
invariant with respect to a change in basis. 

In fact, on changing the basis we get 


εἰ = Af Ae ty*® 
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But, from (35.1) 
> 9 Hk τ 
Ab Αἱ ΕΞ δὴ ΜΞ 
ΠΣ [ ifk’ =f, 
which gives 
ti= ἐν". 


Consider next a mixed tensor of order g and choose a pair of in- 
dices, one covariant and one contravariant. In order to simplify the 
notation let these be the first two indices of the tensor 


is 
Pit Sa  ἐρ 


Let i; and i, be put equal to i and sum over this repeated index. We 
now show that the quantities 


f Peli 
thus obtained are the components of a tensor of order (g—2). Let 


[X(3),-.-»Xg)] be (g—2) arbitrary contravariant vectors. In virtue of the 
results of §36 the quantities 


ἌΝ ΣᾺ 
are the components of a mixed second-order tensor. It follows that 
for any vectors [X,3)...X,)] the quantity 
titi, ..teX (3) * XC) 
is invariant with respect to changes of basis which demonstrates the 
tensor character of ὦ ii, οἰ ρ 
The operation which consists in equating two indices, one contra- 
variant and the other covariant, and summing over their common 
values is called contraction. The contraction of two indices in a tensor 
of order q generates a tensor of order (q—2). 
It is clear that if the tensor under consideration has more than one 


pair of indices, one covariant and the other contravariant, the opera- 
tion of contraction can be repeated. 


39. Contracted multiplication. A general criterion of tensor character. 
One frequently encounters tensor products where the contracted 
indices belong to different factors of the product. We shall call this 
contracted multiplication. The operation of contraction may, more- 
over, be repeated many times in such circumstances. 
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If t,;and u””” are the components of two tensors their tensor pro- 
duct has the components 


Punt” = tijkl μην 


and the tensor 

. Puri” ἘΣ tijkt yk! 
is one of their contracted products, obtained by contracting the 
indices m, k and a, 1. 

The concept of contracted multiplication can be used to specify a 
criterion of tensor character which generalizes that of §36. This can 
best be expressed in terms of a particular example: 

In order that the quantities t™ referred to a basis (e;® 6) Ὁ ἐκ ® εὐ 
shall be the components of a contravariant tensor it is necessary and 
sufficient that, for any covariant tensor 5 κι» the quantities tuk! 5 shall be 
the components of a contravariant tensor. 

The condition is evidently necessary from the study of contracted 
multiplication which we have just carried out. To show that it is 
sufficient we only have to observe that it is possible for s,; to be the 
tensor product of two covariant vectors with components x, and γι, 
and to use the second theorem of §36. 


40. Symmetric and antisymmetric tensors. A contravariant second- 
order tensor, whose components referred to a given basis are t’, is 
said to be symmetric in its two indices if 


= yl 
and antisymmetric if i= —2", 
Suppose, for example, that it is symmetric and change the basis: 
“τς ΑΚ At = AK AL a. 


Hence the fact that a tensor is symmetric (or antisymmetric) is a 
property of the tensor itself and not merely of its components with 
respect to a particular basis. 

The same considerations hold for a covariant tensor of second 
order. They extend immediately to pairs of indices, both contra- 
variant or both covariant, of tensors of order g > 2. 
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ΠῚ. EUCLIDEAN TENSORS 


41. Euclidean tensors and their components, Let us now take E,, to bea 
Euclidean vector space. In this case we shall show that every tensor of 
order q can be identified with a contravariant tensor of order 4, so 
there is no point in considering contravariant, covariant or mixed 
tensors of the same order to be distinct entities. 

Suppose E,, to be referred to an arbitrary basis and E* to the cor- 
responding dual basis and designate the coefficients of the fundamental 
quadratic form by g;;. We have seen in 824 that we may identify the 
element in ΕἾ, which has the components x! with the element in E* 
which has the components x; since 


x; = gx’, x! = g" x;. 


Having established this, let us consider a contravariant tensor T of 
order q which is the tensor product of q vectors X(1)> X(2))-+ +9 X(g): 


T= Χι @ X(2) @ eee Ὁ Xa). 


To each of these vectors there corresponds a particular element in 
Εἰ. Consider the various affine tensors obtained by replacing one or 
more of the vectors X,4), X(2),-.-, X(g) by their corresponding elements 
in E,,. These affine tensors are said to define the same Euclidean tensor 
whose contravariant, covariant or mixed components are the compo- 
nents of the corresponding affine contravariant, covariant or mixed 
tensors. 

The relations between the various types of component of the 
Euclidean tensor T may readily be determined. Denoting the contra- 


variant components of the vector x1) by x/,) the contravariant com- 
ponents of T are 
fele...fe hh Ss A 
ὡς χίρχί. . Xf: 


If, for example, we replace the vector xi) by the corresponding 
element of E,; we obtain the mixed components, one-fold covariant: 


fe Bevcvhe -. 
th = x0 X02, XGy-- Xi, 


where X (2) = Sinjr Xb): 
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From this we deduce that 
pit, ἰδενεῖα -- Sis pirisis...ée (41.1) 
Conversely, we have 
εἰν. ae githe ple, teveoke (41.2) 


If this operation is repeated using the index i; we obtain the two- 
fold covariant components 
e*,. ΩΝ" covle — Sis js εἰ" ἷε..«ἱᾳ -.. LinjeSis js εἰ" jaJjais... ie (41.3) 
Operating on all the indices we finally obtain for the covariant com- 
ponents 
tik a Sado eat (41.4) 


Conversely, the contravariant components may be expressed in 
terms of the covariant components as | 


pites-te = miele: ae | ee (41.5) 


It is now seen that, on multiplying by gj or 5} and summing, each of 
the g indices of the tensor T may be placed either in the contravariant 
or in the covariant position. 

Let T now be any contravariant tensor of order g; such a tensor 
can be expressed as a sum of p tensor products of g vectors. Each of 
these tensor products defines a Euclidean tensor. Consider the various 
affine tensors of the same type associated with these Euclidean tensors. 
In summing them one sets up a correspondence between the tensor T 
and the affine tensors whose components are given by formulae such 
as (41.1), (41.3), (41.4) by virtue of the linear character of these 
formulae in the components ¢"*---“ of T. It follows that these affine 
tensors do not depend on the way in which T has been decomposed 
into a sum of tensor products and they are consequently uniquely 
defined by T. We expect these various affine tensors to define a unique 
Euclidean tensor whose various components are the components of 
identical affine tensors. This can be expressed formally as follows: 

THEOREM: The various contravariant, covariant or mixed com- 
ponents of a Euclidean tensor can be derived from each other by 
multiplying by gj or g! and summing, this operation being performed 
one or more times. 
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The criteria of tensor character which have already been given for 
affine tensors obviously apply without modification to Euclidean 
tensors; the covariant components (for example) of a Euclidean 
tensor being the same as the components of the corresponding 
covariant affine tensor. 


42. Symmetric and antisymmetric Euclidean tensors. A second-order 
Euclidean tensor is said to be symmetric if the associated contra- 
variant affine tensor is symmetric. We then have 
PY πὰρ 

Therefore 

Sie Si syt! = Si syt” = Tks 
and the covariant components are also symmetric in their two indices. 
Conversely, if the covariant components are symmetric, it is clear 
that the contravariant components are also symmetric. Analogous 
considerations hold for antisymmetric Euclidean tensors. They may 
also be applied to any two indices of a tensor of order g > 2. 


43. The fundamental tensor. The scalar product of two vectors x and y 
of E,,, which have the contravariant components x! and γ' respectively 
is given by 

xy = gyx'y/, 
This expression is invariant under changes of basis for any pair of 
vectors x and y. It therefore follows that the g,; are the covariant 


components of a Euclidean tensor. We call this the fundamental 
tensor of the space E,,. 


This tensor, being symmetric, has only one set of mixed components 
gi = ng = g" gy; = gi, 
Let us evaluate these. From the definition of the g* (21.5) we have 
5 


where «/* denotes the cofactor of gj in the determinant g. The 
numerator of the right-hand side is therefore the same as the expan- 


ΓΝ - 
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sion of g except that the g,; are replaced by the g;;. It follows from the 
properties of such expansions that 
0 tis s 


ἀμ [ if i = jt 
The contravariant components of the fundamental tensor are 
therefore given by 
εἰ εἰ, = αἴ. 


These components are, of course, identical with the quantities gl 
introduced in §21. We can now state: 

THEOREM: The quantities g, and α΄ are respectively the covariant 
and contravariant components of a certain symmetric tensor which is 
the fundamental tensor of Euclidean space. The quantities defined by 
(43.1) are the mixed components of this tensor. 


(43.1) 


44. Euclidean tensor algebra. We have seen that the various com- 
ponents of a Euclidean tensor may be expressed as linear forms in the 
contravariant components (for example) of this tensor. It follows that 
the algebraic operations given for affine tensors may be extended to 
Euclidean tensors. 

(a) Addition. Tensor addition (considered in §37) gives a corre- 
spondence between two Euclidean tensors of the same order 4 and a 
third Euclidean tensor of order g called their sum. If the two tensors 
have contravariant components 10 5 and u'*::-*, then the contra- 
variant components of their sum are given by 


gitit...te = pits. doy piris.. te 


Repeated contracted multiplication by the fundamental tensor gives a 
Similar relation for the covariant components: 
δ᾽, ἐς. ἢ, = fin. ὦ TT Uh. le 
An analogous result holds for the various components with mixed 
indices, 


+ When ij the expansion is that of a determinant with a repeated 
column, and therefore vanishes. When i = j the expansion is of g. 
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(6) Multiplication. Tensor multiplication generates a Euclidean 
tensor of order g+q’ from two Euclidean tensors of orders g and q’. 
If the tensors under consideration have contravariant components 
t+! and u’**--- ‘+’, the contravariant components of their product 
are given by 


piss teless+- dase’ = εἶνε εὶς leet. sdape’ 


This equality obviously remains valid if one or more of the similarly 
positioned indices of the two sides is placed in the covariant position. 

(c) Contraction. Consider a Euclidean tensor with contravariant 
components δ.“ and choose two arbitrary indices, for example 
the indices 7, and i,. When the first of these is moved to the covariant 
position the corresponding components are 


is...iée os 


ti, = Sis j: pie 
By contracting the indices j, and i, it follows that 


ig 


"emesis ἊΝ “ὦ Ξ ἕω ae (44.1) 


using the symmetry of the g;. The quantities (44.1) are the contra- 
variant components of a Euclidean tensor of order (4 -- 2): 


οἷ .««ἷςᾳ = rj eee io (44.2) 


It follows from (44.1) that this tensor does not depend upon which of 
the indices i, and iis lowered. (44.2) remains valid if one or more of 
the indices i3...i, is placed in the covariant position. 


45. The space E{” as a Euclidean space. A Euclidean space may 
readily be constructed from E™, which is the gth tensorial power of 
the Euclidean space E,.. 

We refer the space E, to an arbitrary basis (e;) and E@ to the 
associated basis (e;,®e;,®...@e;,). Any two qth order tensors, T 
with components f*"*---" and U with components u*#*::-"*, define a 
scalar 

TU = erly, ig (45.1) 
such that 


TU = Bis piss Big jth ult -te a ty, altor-Ie, (45.2) 
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It is clear that the rule of composition thus defined has the pro- 
perties detailed in §15 which characterize the scalar product of two 
vectors in a vector space. We say that (45.1) is the scalar product of 
two tensors T and U. This scalar product defines a Euclidean space 
constructed on Εἰ) 

In particular the scalar product of two elements (e;, Ὁ €;,® ... Θ 6.) 
and (ς;, Ὁ 6.... Ὁ ει of the base of E is according to (45.2), given 
by 


(€;, Θ €;,® ... DE, Ej, Dl, Ὁ... Oey, = Bis Sia jer++ icin (45.3) 


so that the fundamental quadratic form of the Euclidean space Εἰ 
has the coefficients Bin js Sis jar Sia jo 
If the basis (e;) of £,, is orthonormal then 


(€;, Be;,@...® €;,)(e), Ὁ €j,@... @ e;,) = Shi mf δι)» 


and the right-hand side is zero if the set (i,,i2,...,i,) differs from the 
set (jj, j2,-.-,Jg) and is equal to unity otherwise. Hence the basis 
(e;,@e;,@...@e;,) of E, derived from the tensor product of an 
orthonormal basis of E,,, is itself orthonormal. 

The covariant components of the tensor T, considered as a vector 
of the Euclidean space E™, are clearly identical with the covariant 
components of T considered as a tensor in E,,. 


IV. OUTER PRODUCTS 


46. Antisymmetric tensors of order 2. Let us return to the case where 
E,, is a general vector space which we refer to an arbitrary basis (e;). In 
the space E referred to the associated basis 6, Ὁ 6) consider the 
antisymmetric tensors 
T = e,@e,= Σ e,@e+ ¥ tMe;@e, (46.1) 
i<j izj 
where fi = —7", f= 0, 


Interchanging the indices on the right-hand side and taking the anti- 
symmetry into account it follows that 


T= > εὕζ(ε;Θ ε —e@e). (46.2) 
i<j 
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Any antisymmetric tensor of E~ can therefore be expressed as a 
linear combination of the "C, elements 


(e;® e;—e; Θ οὐ), (i « ἢ (46.3) 
and these "C, elements obviously form a linearly independent system, 
otherwise the (e;@ e;) could not be linearly independent. This can be 
formulated as follows. 

THEOREM: The antisymmetric tensors in E® generate a vector 
sub-space A? of E® having "C2 dimensions. This sub-space has the 
elements of order 2 defined by (46.3) as a basis. 


47. Outer product of two vectors. DEFINITION: Given two vectors x 
and y of E,,, the antisymmetric tensor 


xAy = x@y-y@x (47.1) 
is called the outer product of the two vectors. 
If x' and γ' denote the components of x and y in (e,), x Ay has the 
antisymmetric components 
PY = x! y/—x/y! (47.2) 
in (e;® e;). 
The outer product of two vectors has the following properties, 
which are also sufficient to define it: 
(a) If x,y,z denote vectors of E,,and «is a scalar, the outer product 
has the usual distributive properties: 
XA(y+z) = XAY+XAz, 
(x+y)AZ = ΧΛΖΈΥΛΖ, 
ΑΧΛΥ͂ = XAay = a(xAy). 
(5) It is anticommutative: 


KAY = -yAx, 
in particular 
XAx = QO, 
(c) If (€;,€,...,€,) isa basis of E,, the "C, elements 
e;Ae; (i « ἢ (47.3) 


form a basis for A, 
An outer product of two vectors is called a bivector. 
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48. Proper components of a bivector. Change of basis. According to 
(46.2) the antisymmetric tensor T may be written 


T= > tle,A ej. 
i<j 


Considered as an element of the space A) and referred to the basis 
(47.3), this tensor has the "C, components 1’ (i <j). These compo- 
nents are called the proper components of T. In order to distinguish 
them from the n? components of T considered as an element of EY 
they are represented by 7“, where it is always understood that the 
index i is less than the index /j. 

Let us determine the transformation properties of the proper com- 
ponents on passing from the basis (e;) to another basis (e;) in E,. The 
spaces Ε(2 and A® will be referred to the bases (e; Ὁ 6.) and (e; A ey) 
respectively and, with the usual notation, we have 


fi = AL AL", 
Therefore κι αὐ τς 
= Ὁ ALA ΕΣ ἀμΆ»ντ΄. 

k<l’ κ»"' 


Interchanging the indices k’/’ in the second term and allowing for 
the antisymmetry of the transformed components 7" it follows 
that : am 

(Ὁ) = Y (AL A} —-ApApyt®”. 
κ' «Ι’ 
Thus the proper components of T transform under a change of basis 
in E,, according to 
M=- Ὁ 


K<I’ 


Ay Al 


(κ΄ 7) 
t ν (48.1) 
Al. Aj, 


49. Outer forms of order 2. We have seen that each vector space E,, 
can be associated with a dual vector space, ΕἾ, of linear forms defined 
on E,. Outer products of order 2 can also be defined in E; ; they are 
the affine antisymmetric covariant tensors of order 2. These tensors 
define a vector sub-space A*™ of E*™, again having "C2 dimensions. 
DEFINITION: An outer form of order 2 is any element of AX. 
If E,, is referred to a basis (ει) then we refer ΕΠ to the dual basis (x') 


Se 


_ 
---.-. “᾿ς. ἘήἘἙ-.ὄ. 
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and the spaces E* and A*® to the bases (x! @ x’) and (x! a x4). An 
element F of A} which has the components fin Ε can be written 


F= > fay x! n x, 
i<j 


where Si =fy  fori<j. 


The “C2 quantities fj) are the proper components of the form F. 
Under a change of basis in E,, they transform according to the 
formulae 


ee ee (49.1) 
Tudeh Abe εὐὴξ, “7 | 
The scalar quantity 
= fap = 34 Σ fyt® (49.2) 


is said to be the value of the form F for an element T of A having 
proper components 7), 

If the tensor T varies over A® the quantity (49.2) constitutes a 
linear form defined on A. Conversely, given a linear form defined on 
A®), this can always be written 


Σ,͵ fap. (49.3) 


The 7 transform under a change of basis according to (48.1) and, 
since (49,3) is a scalar, the f(, transform according to (49.1) and are 
therefore the proper components of an element of A*®). So the affine 
antisymmetric covariant tensors of order 2 can be interpreted equally 
well as elements of A} or elements of the space [A]* dual to A, 


50. Completely antisymmetric tensors. In view of their importance 
we have restricted ourselves so far to the study of antisymmetric 
tensors of order 2. Analogous results can be shown to hold for 
completely antisymmetric tensors of order g < n, that is for tensors 
which are antisymmetric with respect to every pair of indices. Con- 
travariant tensors of this type constitute a vector sub-space of E@ in 
"Ὁ, dimensions. "C, proper components are therefore sufficient to 
define them. 
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We begin by indicating the results for g =. A completely anti- 
symmetric tensor of order n has only one proper component f?--”, 
Because of the antisymmetry the ordinary components of this tensor 
are given by 

piris-eein Ἔ: εἰν|5....».(12...5 (50.1) 
where e!''*---“is equal to zero if any two indices are equal, to + 1 if the 
permutation (i;,i2,...,i3) of the set (1,2,...,) is even and to — 1 other- 
wise. 

Under a change of the basis (e;) the proper component r@?---” 
transforms according to the formula 

p12...) -- 4ι([ 2 ...π) (50.2) 

where 4 is the determinant 
Al 41, ... Ah 

1 2 
Anil 22 (50.3) 
Me's See tie ΝΣ 

51. Outer products in a Euclidean space. Suppose E,, is a Euclidean 
space. Then the various tensors defined by outer products of vectors 


are Euclidean. The contravariant and covariant components of a 
Euclidean tensor of order 2 are related by 


Ι 
ty = Snel. 
If the tensor under consideration is antisymmetric, its contra- 
variant and covariant proper components are related by 


es Sik Sik 
k<!| Su Sy 
as can readily be established by an argument similar to that given in 
§46 and 848. The space A™ has the structure of a Euclidean space, 
its fundamental quadratic form having the coefficients (¢j.2j)— £i12jx)- 
For acompletely antisymmetric tensor of order m, similar reasoning 
shows that 


p*) (51.1) 


li) 


ἴα...) = ptt? -™, (51.2) 


where g designates the determinant constructed from the gy. 
4 


σ΄ σ΄ ὧν. a -“- μὰ «ἢ -α τε γΓ' - πα σας “τ " - 
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52. Adjoint tensor of a completely antisymmetric tensor. When E,, is a 
Euclidean space there is a particularly interesting completely anti- 
symmetric tensor of order m which is directly associated with the 
determinant g. For simplicity we assume the space to be properly 
Euclidean.f 

It is of interest to find a geometrical interpretation of g in the Euc- 
lidean vector space of elementary geometry. If x, y, z denote three 
arbitrary vectors with components x’, y’, z' referred to an orthonormal 
basis in this space, then the volume V of the parallelopiped, which has 
the three vectors x, y, z as edges, is equal to the absolute value of the 
determinant 


Its square 
| Sexo" Siete Peewee | 
Ys) Ser SOP Dye 
Σ χίχί Pyle Wel? 
can be written, in terms of invariants, as 


| x* xy xz 


V7=!|x-y y* yz. (52.1) 
x'z yz 12 
Now let (e;,€2,e3) be any basis of the space under consideration. 
The elements of the determinant in (52.1) are just the g, which cor- 
respond to this basis if e,, 67, 61 are taken to be the vectors x, y, z. 
Hence, if V is the volume of the parallelopiped constructed on 
(€;,€2,€3), we have 
V2 =g. 


Having established this result let us return to the proper Euclidean 
space P,, and make the change of basis which carries (e;) into (e;). 


+ This ensures that the determinant g is positive and therefore simplifies 
manipulations involving +/g. (T.) 
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How does the determinant g transform under this change of basis? 
We know that its elements transform according to the tensor rule 


ser = Ay ΑἹ κῃ. 
It then follows from a standard theorem on the multiplication of 
determinants that, if g’ denotes the determinant formed from the 
Sk’ I's 
ε΄ = A’g 
where A denotes the determinant (50.3) constructed from the Aj. 
Taking the square root of both sides it follows that 


Sg aye! (52.2) 


Let us limit further developments to changes of basis such that the 
determinant J is positive. We say that this restricts us to bases which 
have the same sense or handedness as (e;). Under such changes of 
basis, the quantity 1/4/g transforms like the proper contravariant 
component of a completely antisymmetric tensor of order n (see 
(50.2)); according to (51.2) this tensor has the proper covariant com- 
ponent +/g. Its ordinary contravariant components may be written 


1 


his... in is is...is 
a = —- (52.3) 
VE 
while its covariant components are given by 
Enis...in = irte...tn V8 (52.4) 


where the quantities ¢;,;, _,, are numerically equal to the e”**:-", 


Given a completely antisymmetric tensor T of order g(< ”) with 
components f’!*-:- or ¢;,;,...;, the completely antisymmetric 
tensor T’ of order (n—gq) obtained by contracted multiplication of T 
with (52.3) is called the adjoint tensor of T. Its components are given 
by 


᾿ Ἦν a 

1 ferteredn -Ξ mg cele te άῳ (52.5) 
4. 
Ι ἧι ig 

ἔμ...» ™ 7] tier ἰοἰορεννοῖν ἢ ros (52.6) 
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The completely antisymmetric tensors of orders n and (n—1) have 
adjoint tensors which are respectively a scalar and a vector. 

Consider, as an example, the Euclidean space of elementary 
geometry, which we assume to be referred to an orthonormal basis so 
that contravariant and covariant components are identical. Let x Ay 
be a bivector of this space and consider its adjoint z. Denoting the 
components of x, y, z by x', γ᾽, χ', the components of the bivector 


X Ay are given by 
p? = ty 3397, 
p3t — x3yl_xly3, 
pi2 — xly2_y2y1 
and if the determinant g is equal to unity (52.5) gives 


ze = x*y—x3 y7, 
2 = χ" γ!"-- χ᾽ γ", (52.7) 
ζῇ = x!ly?-x7y! 


In elementary vector analysis, the vector z is known as the vector 
product of the two vectors x and y. The existence of such a vector 
product is clearly related to the fact that the space of elementary 
geometry is three dimensional. 


CHAPTER IV 


Curvilinear Coordinates in Euclidean Space 


I. DERIVATIVES AND DIFFERENTIALS 
OF VECTORS AND POINTS 


53. Vector derivative of a vector. Let E,, be an n-dimensional vector 
space. If a vector x in E, corresponds to each value ¢ of a scalar 
variable in the interval (a, δ), we say that x is a function of t and write 
x = x(r). 

We now introduce a positive definite quadratic form so that E,, isa 
proper Euclidean space. The variable vector x is then said to tend to 
the zero vector (or more briefly, to tend to zero) if the scalar |x| tends 
to zero. It is clear that this definition is independent of the particular 
quadratic form chosen. 

We say that the vector x(#) is a continuous function of tif, when t is 
increased by 4, the vector 


Ax = x(t+4/)—x(t) 


tends to zero as 4t-+0. If there exists a vector x’ such that 


tends to zero as At->0, x’ is said to be the vector derivative of x for the 
value ¢ of the variable. The vector 


dx = x’dt 


is called the differential of x and we write x’ as dx/dt. 

These definitions are simple generalizations of the analogous 
definitions of elementary vector calculus. The formulae for the 
derivatives of a sum, of a product of a vector and a scalar, and of a 

[53] 


ae 


a 
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vector function of ascalar function are formally identical with the well- 
known formulae of elementary vector calculus and are demonstrated 
in exactly the same way. This also holds for the derivative of the scalar 
product. 


54. Vector derivative of a point. Let &,, be an n-dimensional affine 
point space. Consider a scalar ¢ which varies over the interval (a, δ). 
If to each value of t there corresponds a point M of &,,, M is said to be 
a function of t and we write M = M(t). 


—> 
If O is an arbitrary fixed point of &,, the vector x = OM is a function 
of t. Suppose that this vector has the derivative x’. It is obvious that 
the vector x’ does not depend on the choice of the fixed point O, but 
only on M. In fact, if O’ denotes another arbitrary fixed point, 


OM = 00’+0'M 
——> 

and, as OO’ is fixed, 

dom) = 20" 

rn )= - M) =x’. 

The vector x’ is termed the vector derivative of the point M and it is 
represented by M’. The vector 
dM = M’dt 

is called the differential of M and we write M’ = dM/dt. 


55. Vector functions of several scalar variables. As in elementary 
vector analysis, a vector x can be a function of a number of inde- 
pendent scalar variables «, 8B, y. The concept of a partial derivative 
can immediately be extended to such a vector function. As in ordinary 
analysis the relation 

Bx. a Sx 


@a0B ap da 


Curvilinear coordinates in Euclidean space 55 


may be established provided that the partial derivatives are contin- 
uous. The differential of such a vector function is given by 


ox ex ex 
dx = oa tt opt ay 
If x is a function of 1 through several scalar variables «(#), B(t), y() 
its derivative is obtained in the same way as in ordinary analysis: 


dx _ Oxda  oxdB | oxdy 
dt @adt oBdt éydt’ 
The above remarks concerning the vector derivatives of a vector 
apply equally well to the vector derivatives of a point. 


Il. CURVILINEAR COORDINATES INA 
EUCLIDEAN POINT SPACE 


56. Curvilinear coordinates. Associated reference frames. We shall 
restrict the remainder of this chapter to m-dimensional Euclidean 
point spaces €,, and we propose to study several geometrical ideas. 
Let &,, be referred at some moment to an arbitrary frame and denote 
the coordinates of any point M with respect to this frame by (x). 
We have already seen that there corresponds one, and only one, point 
Mof @,,toeach system of nnumbers (χ᾽, x”, ...,x”), and conversely. In 
order to distinguish these coordinates from those which are intro- 
duced below we shall call them rectilinear coordinates.* 

Consider n continuously differentiable functions f‘(y',y?,...,”) 
of n variables γ᾽ and put 


x! = [νον (= 1,2,...9n). (56.1) 


These n functions are assumed to be independent so that, when the 
variables (y’) vary over a domain 9’, the system of n equations (56.1) 
can be solved for the »’, giving 


et £60 04652) C= 1,2,..59. . CED 


+ The essential distinction between a rectilinear coordinate system and 
a curvilinear one is that in the latter the reference frame (e,) is a function 
of position while in the former it is constant. (T.) 
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Here the point M, whose coordinates are (x'), varies over a certain 
domain 9 of @,,. Since the πὶ functions (56.1) are assumed to be 
independent, the functional determinant or Jacobian 


of of” 
ay! eee opt 
D(x', x?,...,x”) ᾿ . 
D(y', ὟΣ Ἢ : (56.3) 
of! of” 
oe SR | 
is different from zero in 9’. This is also true of the functional deter- 
minant 
D(y'y*...y”) 
D(x! x*...x”) 
which is the inverse of (56.3). 


This demonstrates the existence of a one-to-one correspondence. 
between the points M of 9 and the variables ("ἢ which range over 9’. 
If the functions f' are non-linear the ν᾽ cannot be interpreted as a 
system of rectilinear coordinates. The point M is a function, many 
times continuously differentiable, of the n scalar variables (y’). We 
say that the space &,, has been referred, in the domain J, to a system 
of curvilinear coordinates (y'). The curves traced out by a point M for 
which only one of the (y’) varies are called coordinate lines; n of these 
intersect at any point M of @,,. In rectilinear coordinate systems the 
coordinate lines are straight which accounts for the name given to 
these coordinates. 

Given a curvilinear coordinate system (y’) and a point M of &,, we 
associate with Ma reference frame called the natural frame at M of the 
system (y'). This reference frame has its origin at the point Mand its 
vectors are given by 


aM 
= a7 G=1,2..00). (56.4) 


It is obvious that the system of n vectors e; is linearly independent 
since the determinant of the components of these vectors referred to 
the initial reference frame is the determinant (56.3) which is non- 
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zero. The n vectors e;are manifestly collinear with the tangents to the n 
coordinate lines which intersect at M. From (56.4) the vector dif- 
ferential of M is given by 


dM = e;dy’. (56.5) 


In other words the n quantities dy’ are the contravariant components 

of the vector dM in the natural reference frame of the system ( γῆι M. 
We speak of transforming the curvilinear coordinates if we sub- 

stitute a new system of variables (y/’) for the (y’). We then have 


y = vy (yy, «2s γ' wane FO I" seve κὐ (56.6) 


where the γ᾽ are many times continuously differentiable functions of 
the ν΄, and conversely. When such a transformation of curvilinear 
coordinates is made the natural reference system (M,e;) of the 
system (y’) is replaced by the natural reference system (M,e;) of the 
system (y”’). It is easy to deduce the transformation rules which link 
one reference system to another. From the formula for the derivative 
of a function of a function we have, in fact 


6M 6M ay! 
ante ay! a dy! dy! 


It follows that 
= —7 ee, 
e; ay! i 
ὃν 
and conversely e; = py 


The basis (e;) of the Euclidean vector space associated with δ᾽, has 
thus been replaced by the basis (e;) given by the linear transformation 


e, = Atey; ey = Ape; (56.7) 

, ὃν] dy! 

7 = -:-- i= ET 56.8 
where Aj dyl’ A ey! ( ) 


This may be stated formally as follows: 

THEOREM: Each change of curvilinear coordinates is associated with 
the change of natural reference frame at M defined by the relations 
(56.7) and (56.8). 
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57. Example of a curvilinear coordinate system. Consider the Euc- 
lidean space, &3, of elementary geometry and let Oxyz be a Cartesian 
reference frame. The cylindrical coordinate system and the polar 
coordinate system in this space are examples of curvilinear coordi- 
nates. The latter system, for example, is defined by 


x=rsin@cosy, y=rsin@sing, z= rcos8, 
and, conversely 
r= J/(x*+y?+27), *=tan'y/x, 6 = tan 


Let us write 


~1VGe+y’) 
Zz 


yer y=, γ᾽ τὸ. 


——> 
The coordinate lines at M are respectively the radius vector OM, the 
circle of radius rsin θ through M with its centre on and plane perpendi- 
cular to the axis Oz and the meridian circle, centre O, radius r, which 
passes through Μ΄. The natural reference frame at M consists of οι, 
the unit vector in the direction OM, 6.5 tangential to the circle about 
Oz and of length rsin @, and 6: the vector tangent to the meridian and 
of length r. We note that these three vectors are always orthogonal. 

In general, when the x vectors e,; of the natural reference system are 
orthogonal, the curvilinear coordinate system is itself said to be 
orthogonal. 


58. The line element. If the Euclidean point space &,, is referred to a 
system of curvilinear coordinates (y’), the vector dM has components 
dy' with respect to the natural reference frame at M. The square of 
this vector, or the square of the distance between two neighbouring 
points, is therefore given by 


ds* = gydy' dy’, (58.1) 

where 8ij = 6" 6)» 
As the point M varies, the natural reference frame also varies. The 
scalar products g;; of two vectors of the reference frame are therefore 


functions of the curvilinear coordinates of the point M. This is the 
case, for instance, in the example of the previous section, where 


ds? = dr*+1* cos? 6 arp? +r? db. (58.2) 
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The expression (58.1) is called the line element or metric. The 
length of arc of any curve in δ᾽, can be calculated from a knowledge of 


the metric. If the curve ABis defined by giving the γ᾽ 5 functions of the 
parameter ¢ which varies over the interval (a, δ) its length is given by 


the integral 
b 
~ dy' dy! 
AB = | |e) 4 (58.3) 


In the same way, the element of volume in this space is the volume of a 
parallelopiped constructed from the vectors e,dy',e,dy’,...,e,dy"; 
i.e. 

dV = »/(\g|)dy' dy" αν"... ἀν", (58.4) 


where g denotes the determinant of the g;. Finite volumes can be 
determined from this by integration. 


59. Tensor fields. A Euclidean vector space E,, is associated with the 
Euclidean point space δ᾽, Each reference frame of δ᾽, defines a basis 
in E,, and, consequently, bases for the various tensor powers of Κ᾿. 
For brevity we shall say that the components of a Euclidean tensor 
with respect to such a basis are the components referred to the 
corresponding frame of δ᾽,» 

Suppose that we associate with each point M of &, a Euclidean 
tensor defined by its components referred to the natural frame at M 
of the system (y’). We call this a tensor field in the system of curvilinear 
coordinates (y'). The remainder of this chapter is devoted to tensor 
analysis, which is essentially the study of tensor fields. 

We have seen that, to each change of curvilinear coordinates there 
corresponds a change of the natural reference frame at M defined by 
(56.7), (56.8). If T, for example, denotes a tensor of order three with 
the mixed components ¢”,, these components transform under a 
change of curvilinear coordinates according to the tensor relations 


fi, = AL AL. AGT τὲ παν 


where the A’s are defined by (56.8). 
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The quantities g;, are the covariant components of a tensor referred 
to the natural frame at M. The fundamental tensor is therefore an 
example of a tensor field and its components σὺ) transform according 
to 


8. = Af Aj By: 


Ill. THE CHRISTOFFEL SYMBOLS 


60. The fundamental problem of tensor analysis. Let the Euclidean 
space &,, be referred to a curvilinear coordinate system (y') and let its 
metric be 

ds* = gydy' dy’, (60.1) 


In the study of tensor fields referred to curvilinear coordinate systems 
the following difficulty is encountered: tensors associated with dif- 
ferent points are referred to different reference frames and, in order to 
compare tensors at neighbouring points with one another, it is neces- 
sary to consider the variation of the reference frame between those 
points. 

Since 

Sy = er &, 

if the metric (60.1) is a known function of position then we know the 
form of the natural reference frame (e;) for the different points of &,,. 
This leads us to pose the following problem: 

Given the Euclidean space @,, referred to a curvilinear coordinate 
system (y") and with the metric 


ds* = σαν! ἀν], 


determine the natural reference frame (M+ dM,e;+ de;)t at the point 
M+dM in the immediate neighbourhood of M in terms of (M,e;). 

The natural frame (M+ dM, e;+ de;) will be completely determined 
relative to (M,e;) if we know the contravariant components of the 
vectors dM and de; referred to the basis (e;). The contravariant com- 
ponents, dy’, of dM are given by 


dM =dy'e, * (60.2) 


+ It is convenient to use the purely symbolic notation M+dM to 
represent a point displaced by dM from the point M. (T.) 
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We can also write 
de; = w! se;, (60.3) 


where the w’; denote the contravariant componentsf of the de;. These 
components are obviously linearly dependent on the dy*. We there- 
fore have 


= Γρ αν" (60.4) 


where the I’,/; denote n? functions of the variables (y*). Our problem 
is therefore ian to the determination of the n° functions I’,/; from 
the [n(z+ 1)]/2 functions gj. 


61. Relations between the I",/;. (1) Given the line element (60.1), the 
natural reference frame at every point M of @,, satisfies 
e;"e; = Si- (61.1) 
It follows that 
e;'de;+e;" de; = εἴσῃ. 
Substituting for the de; from (60.3) and using (61.1) we obtain 
Sin K+ oj, w* i = agi. (61.2) 


The form of (61.2) leads us to introduce the covariant componentsf 
oy of de; in addition to the quantities w/, and the coefficients I’,;; of 
dy* in the expression for ωμ- We then have 

on™ Γ «je ᾿ 
where 
wy = βθΒκωΐ; Dy = enlis Tid =o" em (61.3) 


A knowledge of the πὖ functions I’;,; is thus equivalent to knowing 
the n° functions J’,/,. Using this notation (61.2) can be rewritten as 


αὐ τὼς = ty. (61.4) 


+ This terminology should not be taken to imply that the ωἱ are the 
components of a tensor (see ὃ 63). (T.) 
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Identifying the coefficients of dy* on both sides of (61.4) we obtain an 
equivalent system of equations relating finite quantities: 


Lg tl ie = % Ri, (61.5) 
where we have written 
ὃ = — 
κϑῃ dy* 


in order to emphasize the covariant nature of the suffix k. 

The system (61.5) comprises as many equations as there are distinct 
quantities 0,g,;. Since there are [n(m+ 1)]/2 quantities 8. (61.5) con- 
tains [n?(n+ 1)]/2 equations. 

(2) The equations (60.2) and (60.3) give the differentials of the 
point M and of the vectors e; respectively and are therefore integrable 
and thus give second derivatives of M and of the e; which are sym- 
metrical with respect to the indices of differentiation. Let us deter- 
mine these integrability conditions for equations (60.2); we have 

eM ὃ (™*) ὃς; 


ὃν" ay/ . ὃν" dy! ὃν 
and substituting (60.3), 
82 Μ 
a ay) = I,,e;, (61.6) 
Similarly 
62: Μ 
ay oy = Dep (61.7) 


Since the left-hand sides of (61.6) and (61.7) are equal we deduce 
that 


I Kj e&, = Dj, ey, 
and hence 
PS, = Ti, (61.8) 
which may, with the aid of (61.3), be written as 


Ly = Dy. (61.9) 
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The quantities I",/; are therefore symmetric with respect to their 
lower indices and the quantities Πρ with respect to their first and last 
indices. For each value of i the integrability conditions (61.9) give 
[n(n —1)]/2 separate equations. The system (61.9) thus comprises 
a total of [n*(n—1)]/2 equations. These equations added to the 
[n?(n + 1)]/2 equations (61.5) giveasystem of n° linear equations in the 
nm unknowns I" kji- 

The conditions for the integrability of the de; could be determined 
in a similar manner. We shall show, however, that the n° relations 
already obtained allow the calculation of the I’,,;;from the g;;and their 
derivatives. It follows that the integrability conditions for the de; are 
necessary criteria for the original problem to be soluble. This is 
associated with the fact that given an arbitrary quadratic form with 
variable coefficients there does not always exist a curvilinear coordin- 
ate system in the Euclidean space &,, which is consistent with the inter- 
pretation of this form as the ds? of that space. We shall return to this 
point later. 


62. Explicit determination of the I",/;. It is a simple matter to obtain 
an explicit solution of the n* linear equations (61.5) and (61.9). 
Allowing for (61.9), equations (61.5) may be written 


Dit ΓμιΞ On By: (62.1) 
Cyclic permutation of the indices gives 

Pg t Digg = 180 (62.2) 

ΓΙ Cjig = ϑιδκι: (62.3) 


Adding (62.1) and (62.2) and subtracting (62.3) we obtain 
2Γμι = On Sy t Sj — Sie 
Using the notation , 
(Ki, j] = 20x 8y+ δι8)κ-- δ) κι). (62.4) 


this becomes 
DV yi = [ki, j]. (62.5) 
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It is clear that the values of J°,;; given by (62.5) satisfy (61.5) and Since the dy* are arbitrary displacements we can equate their co- 
(61.9). Using (61.3) we obtain efficients on both sides of this equation. Hence the I’,/; transform 

Γμ, = g"* Tey; = g**[ki, hl. according to the formula 
. π᾿ Τ' πι' r 
This suggests another bracket symbol defined by 3 Dy, = Aj Ady Ag Dy" p+ At Oy Ai, (63.2) 
{κ᾿ = g"[ki, hl, (62.6) which involves the second order derivatives of the functions defining 
so that the transformation. 
Γ᾿ = (ἐὺ. (62.7) 

The symbols defined by (62.4) and (62.6) are called Christoffel IV. ABSOLUTE DIFFERENTIALS AND 

symbols of the first and second kind respectively. They provide a means COVARIANT DERIVATIVES 


of calculating the I’, ;, and I’,/;from the g, and their derivatives. This 


completes the solution of our fundamental problem. 64. Absolute differential of a vector. (/) Let us consider a field of 


vectors vin &,, defined by their contravariant components οἱ and try to 
determine the contravariant components of the differential dv of a 
vector of the field for an infinitesimal variation of the point M. On 
ωΐ, nor the L x/; are the components of a tensor. Let us determine the passing from M to M+dMnot only do the components v' change, but 
way in which these quantities transform under a change of curvi- the natural frame is also modified in the way we have just seen. Since 
linear coordinates. The vectors of the natural reference frame trans- the vector v is defined at all points of &, by the relation 

form according to the relations 


63. Transformation of the Γ᾽ ἡ. It is important to note that neither the 


pei 
ee. Aj ey, ¥ Ue 
and differentiating we get by differentiation 
ἘΣΎ. Ἢ 
εἶς; A; de; +dA; ey. da = dv'e,+v! de;. 
However, from (60.3), 


j 55 Substituting for de; from (60.3), and relabelling the indices, 
de; => ὁ ie dey = ὦ Vem’s 


giving dy = dv'e,+v*w',e;. 
we, = Αἴ w” pO + dA} εν = (Αἴ Ahyw”™ p+ Ab dAlye,. It follows that the contravariant components of the vector dy are 
Equating the coefficients of e; we see that the w/; transform accord- ΘΝ ῸΝ ΝΥ Δεν er 
ing to the formulae v= dutw μυ-. (64.1) 
ω, = At Ad + 41 ἀκαἵ΄ (63.1) According to their definition the quantities Vu' transform like the 


contravariant components of a vector, although this is evidently not 
true of the quantities dv’. For this reason Vv’ is called the absolute 
a . differential of οἷ. dv is often loosely termed the absolute differential 
Γμ ἀν" = Ai ΑΓ, pdy” + Ad 2, Aj dy*. of v. 
5 


Expanding both sides of (63.1) in terms of the quantities Γ ἢ we find 
that 


66 Elements of tensor calculus 


Partial derivatives may be introduced instead of differentials; σοί 
can be expressed as a linear differential form with respect to dy as 
follows 

Vo! = δρυΐ dy*+ Γι, υἷ dy" ΞΞ (@,0'+ T,/,v") dy*. 


Now the dy* (which take arbitrary values) are the contravariant 
components of a vector. Hence the quantities 


ν υἱ = δι I, ',0" (64.2) 


are the components of a tensor, the index k being covariant. We call 
this tensor the covariant derivative of the vector νυ. It should be noted 
that, like the dv’, the 2, οἱ do not form a tensor. 

If the absolute differential or the covariant derivative of the vector 
v is identically zero, all the vectors of the field are identical; in other 
words the field is uniform. 

(2) Suppose now that the vectors v of the field are defined in terms 
of their covariant components v;. We propose to determine dy in 
terms of its covariant components Vv;. To this end we introduce an 
arbitrary uniform field w with components w’ and form the scalar 
product 

wey = w'v;. 
Since dw is zero, we have 


w-dy = νυν dv,+v;dw' 


and Vw! = dw'+a!, w" = 0, 
so we obtain 
w-dv = w'dv;—w!,v,w", 
or wi Vu, = w(dv;—w"';v,). 
This equality holds for any values of the w‘, so we have 
Vu; = dv;— wv), (64.3) 


Vv; is called the absolute differential of v;. Similar reasoning to 
that above enables us to pass from differentials to derivatives and it 
follows that 

νυ, = 20, -T yp, (64.4) 
where the V;,.v; are the covariant components of the covariant tensor 
derivative of the vector υ. 
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65. Absolute differential of a tensor. The preceding considerations 
may be extended without difficulty to any field of tensors of order q. 
Let us determine the differential of a tensor of the field, considered 
as a vector of the space E™, in terms of its components referred to the 
natural frame (M,e;). We shall call this the absolute differential of the 
tensor. To simplify the working we shall consider as an example a 
tensor T of order 2 defined by a system of mixed components ¢/. The 
ool μανα of the absolute differential relative to (M,e;) are written 
as Vit/. 

Consider two arbitrary uniform vector fields v and w and form the 
scalar t;/v' w;. The differential of this scalar, for an infinitesimal varia- 
tion of M, is given by 


ἀκ οἷν») = Vitso'w) = ¢/+Vt/)o' w,—t/o'w, 


since the two vector fields are uniform. 
On the other hand 
d(t/v' Wj) = dt/v' » t/ dv! wyt οἱ dw, 
where, since the fields are uniform, 
dv! = —w',v", dw; = w"; Wh 
We conclude, on relabelling the summation indices, that 
νη οἱ ΝΗ (αἱ -- {μ᾿ w!; + w’;) οἱ Wj. 
Since this equality holds regardless of the values of v' and w, it follows 


that 
Vt? = dt{—ow" t+, t}. (65.1) 


The quantity Vz; can also be expressed as a linear differential form 
with respect to the dy“. Using (60.4) we get 


Vt? = (δι 1, -- Γι + Ty), t)) dy* 
which may be written as 


νη Ξ- V,t/ dy*, 
where 
V,t/ = δι1,-- Γι tJ Γρ κε, (65.2) 


nh ei 


Ee 


¥ | 
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It is clear that the quantities V,.¢/ are the components of a tensor. 
Wecall this the covariant derivative of T. The general rules of composi- 
tion for the absolute differential and for the covariant derivative of a 
tensor may be inferred immediately from (65.1) and (65.2). 

The usual rules hold for the absolute differential of a sum or pro- 
duct of tensors. Absolute differentiation and contraction are obviously 
permutable operations in the sense that the result is independent of 
the order in which they are carried out. It follows that contracted 
products differentiate according to the same rule as general tensor 
products. There is an important result due to Ricci concerning the 
absolute differential of the fundamental tensor g;;. This differential 
is given by 

Vey = dgy—w" 43-0", 8 ip. 


Using the equations (61.2), which were used in the determination of 
the w/;, we see that the right-hand side is identically zero. 

THEOREM: The absolute differential of the fundamental tensor g;; is 
zero. 

It follows immediately that absolute differentiation and the raising 
or lowering of indices are commuting operations. 


66. Acceleration vector of a moving point. Let us consider a moving 
point M in the space δ᾽, whose position is a function of a scalar 
parameter ¢ which we shall regard initially as the time in order to be 
able to use the terminology of elementary kinematics. Then the 
curvilinear coordinates (y') of M are functions of ¢ and the velocity 
vector of M is given by 
dM 

dt © 
Since the vector dM has contravariant components dy’, the vector v 
has contravariant components 


Ῥ Ξ- 


ἐνὶ 
ΡΣ ες τ 
υ ih’ (66.1) 
The acceleration vector of M is the derivative of the velocity vector 
dy 
Q 


wis 
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Since dy has contravariant components Vv’, the vector y has the con- 
travariant components given by 


which becomes, using (66.1), 


ae k 
fr Ze. 
y dt dt dt 


If the acceleration of M is always zero its trajectory is a straight 
line in &,,. Hence the straight lines of δ᾽, are defined in terms of curvi- 
linear coordinates (y') by the system of differential equations 

dy’ dy" dy* 
—+Ir,',—— =0 (¢=1,...,n). 66.3 
de ** ar de ( n) vee 

A convenient choice for the independent variable ¢ is the distance 5 
of a point of the line measured along the line from a fixed origin. The 
functions y‘(s) therefore satisfy the differential equations 

d? y! : dy* dy" 

a ταν ΓΕΡῸ ας = 

ds* ds ds 
on a straight line in &,,. Equation (66.3) was derived by making an 
appeal to the existing ideas of elementary kinematics. More generally, 
if ¢ represents any parameter it may be shown that the equation 
represents straight lines in &,,. This point will be taken up again in 
§76. 


(66.2) 


0 


V. DIFFERENTIAL OPERATORS IN 
CURVILINEAR COORDINATES 


67. Gradient of a scalar function. Consider a scalar field defined by a 
function ¢ of the curvilinear coordinates γ᾽ of the point Μ. The 
absolute differential of this field reduces to the ordinary differential 
dé of thescalar ¢, this differential being itself a scalar. The correspond- 
ing covariant derivatives are given by 2, ¢. It is easy to verify that the 
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6, ¢ transform under a change of coordinates like the covariant com- 
ponents of a vector. This vector is called the gradient of the function 
φ. We write 

grad, ᾧ = O¢. (67.1) 


The contravariant components of the gradient of ¢ are given by 
grad’¢ = αἴ" a,.¢. (67.2) 
Beltrami has introduced the norm of the gradient given by 
Aid = g4a,4a4, (67.3) 


which he calls the differential parameter of the first kind. In elementary 
geometry, when the space is referred to Cartesian coordinates, this is 


given by 
2 2 2 
ais = (2) +(5) +2): 
ὃν oz 
68. Curl of a vector field. Consider a vector field v with covariant com- 
ponents v;. We have seen that 


Vj; = ὃ)υ,-- Ij vy. (68.1) 


Since the Is are symmetric in their lower indices we can interchange 
the indices i and j in (68.1) to get 


Viv; = 0,0,- Ij vp. (68.2) 
Subtracting (68.2) from (68.1) we find 
ων, ΞΞ 8;0;— jv}. 
It follows that the quantities 
ὃιυ,-- jv; 


are the covariant components of an antisymmetric tensor. This tensor 
is called the curl (or rotor or rotation) of the vector νυ. We write 


curl,;V = διυ,-- δ)υ;. (68.3) 


In elementary geometry the vector adjoint of the antisymmetric 
tensor (68.3) is called the curl vector. 
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69. Divergence of a vector field. Consider a vector field v with contra- 
variant components οἷ, We call the scalar 


divv = V,v' 
the divergence of the vector vy. Since 
Vjui = διυ-- Γ᾿ υ", 
we have 


divy = 8,0'+Tj,v". (69.1) 


This formula may be rewritten using a simplified expression for the 
quantities Γἧ,. Ricci’s theorem (§65) relating to the fundamental 
tensor may be written in covariant form as 


ViSu = n8y—-li'i8ia—-Ln'i8ix = 0. 
Contracted multiplication of this by g’ gives 


g! @,g¢y—-T;';-T;/; = 0, 
or 
Tj, = te" δια}. (69.2) 


The form of the right-hand side of (69.2) brings to mind the 
derivative of the determinant g. If « denotes the cofactor of the 
element gj; in g we have 


ong = a! ang, = ge" 0,23. 


It follows that 
1é,g 8, ν 78] 
τ ἘΣ HEN ; 69.3 
PR ger) ait on 
Substituting (69.3) in (69.1) we obtain 
: 1 
divy = —— af (|g|)v']. (69.4) 


ν [5] 
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In curvilinear coordinates the following integral over an n-dimen- 
sional domain Z may be used to introduce the multiple integral of 
order n: 


1 
| sivvav - ls alV(gl)eo'l νῸ[4}) ἀν"... ἀν" 
J J ν |g! 


= | @lV ([4|} v'] dy'...dy”. 


“ 


The last expression can be transformed into an integral over the 
boundary of Z which is just the flux of the vector field v passing 
through this boundary. 


70. The Laplacian of a function. In the space of elementary geometry 
referred to a Cartesian coordinate system (x, y, z) the expression 
Od ed A |. 
4, φ = Ὅχ2 + oy +52 = div grad ᾧ 

is often of interest. It is called the Laplacian of the function ¢ (or the 
Beltrami differential parameter of the second kind). 

More generally, given a scalar function ¢ of the variables y ‘in δ᾽, we 
call 

4,4 = div gradd (70.1) 


the Laplacian of ¢. The preceding considerations enable us to deter- 
mine an expression for the Laplacian of ¢ in terms of any system of 
curvilinear coordinates. Using Ricci’s theorem 


4;φ = Vi(g4 8,4) = g/V;(2;4). 
It follows that 
A,¢ = εἴ(δι φ-- Γΐ, 2,4). (70.2) 


Using the expression (69.4) for the divergence we deduce the 
following alternative form of the Laplacian which is often convenient 
in practical calculations 


| 
A = —¢a i . . 
29 Vial ἐν (lgl)s” 49) (70.3) 


CHAPTER V 


Riemannian Spaces 


I. TANGENTIAL AND OSCULATING 
EUCLIDEAN METRICS 


71. Definition of Riemannian spaces. Let us consider an n-dimensional 
point continuum V,, which is many times differentiable. The best way 
to visualize such a continuum is by imagining that it corresponds to a 
dynamical system with n degrees of freedom. We assume that the 
immediate neighbourhood of each point Mo of V,, can be represented 
by a set of m coordinates y' capable of assuming all values in the 
neighbourhood of the νά which are the coordinates of Mo. These co- 
ordinates γ΄, which serve to represent analytically a certain part of 
V,, can obviously be chosen in an infinite number of ways. We shall 
arrange in transforming from one system of coordinates ("ἢ to an- 
other (ν7), that the new coordinates are continuously differentiable 
functions of the old (to a sufficient order), and conversely. 
Let us associate the continuum V,, with the metric given by 


ds* = g,,dy' dy! (71.1) 


where the coefficients g,; are arbitrary functions of the y', subject only 
to the condition of being continuously differentiable to a sufficiently 
high order. If this metric does not satisfy the integrability conditions 
given in §61, curvilinear coordinate systems cannot exist in &, such 
that the metric of @,, takes the form (71.1). In this case we say that the 
metric (71.1) is non-Euclidean and that it defines V,, as a Riemannian 
Space. 

A Riemannian space is thus simply an n-dimensional continuum 
with an arbitrary metric, Such a space is said to be properly Rieman- 
nian if the metric is positive definite. The expression (71.1) can, quite 
generally, be put into the form of an algebraic sum of m squares of 
linear differential forms. The set of signs (+ or—) which precede the 
Squared terms in this sum is called the signature of the form. 

[73] 
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72. Tangential Euclidean metric at a point. The simplest way of des- 
cribing the geometrical properties of a Riemannian space is to identify 
it locally, as far as possible, with a Euclidean space δ᾽, To this end we 
introduce the concept of a tangential Euclidean metric at a point 
M 0 of gt 

Consider a Euclidean space δ᾽, with the same signature as V,, anda 
point Mg of the Riemannian space with coordinates (v4). Suppose the 
point My to correspond to a point my of δ᾽, with a natural frame (mo, e;) 
which is subject only to the conditions 


e;"e; = (gy)o (72.1) 


where (g4;)9 designates the value at Mp of the coefficients g;; of (71.1). 
Suppose that each point M in the vicinity of Mp in V,, is brought 

into correspondence with a point m in the vicinity of mp in δ᾽, in the 

following way: if M has the coordinates (y') then m is defined by 


---» 
mgm = [(y'—y»4) + Pin Οὐ -- γῷ Ie: (72.2) 


where the functions ¥(,) are restricted to be at least of second order 
with respect to the variables (y* — yi). We then say that the correspon- 
dence defines a first-order representation of the vicinity of Mp. The 
point mm is said to be the image of M in the representation, mg being 
naturally the image of Mo. 

According to the formula (72.2) the point m is defined as a function 
of the m scalar variables (ν᾽). It follows that the (y') constitute a system 
of curvilinear coordinates in the Euclidean space δ᾽, in the neighbour- 
hood of mo. This system of curvilinear coordinates has the natural 
frame at 79 defined by the vectors 


(5}.-" 


which coincide with the frame (77, e;) given initially. 
It follows from (72.1), if the metric of the Euclidean space @,, in the 
coordinate system (y’) is designated by 


ds” = ξμάν! dy! (72.3) 
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then for γ᾽ = γέ, we have 


(Zio = 6.6; = (yo- (72.4) 


The metrics (71.1) and (72.3) thus have the same coefficients for 
y' = yd and are said to be tangential at y' = νά. 

Let us make a general transformation of the (ν᾽ to new coordinates 
(υ7) and let (mmo,e;) be the natural frame at mp for the coordinates 


—> 
(y”’). It follows at once that the vector mgm defined by (72.2) can also 
be written as 


pained ’ , , , , 
mom = [(»/ -- yh) +24) (" -- Ὁ 16, 


where the functions 2 satisfy the same condition as the ¥. It is now 
apparent that the concept of first-order representation is independent 
of the system of variables employed or, as we shall say, it has an 
intrinsic character. 

In such a coordinate transformation the (2;)) change according to 
the relations 


(Zo = (AF )0(4j)o (Se ro 
where A; = 


In order that the concept of tangential Euclidean metric shall have an 
intrinsic character it is necessary and sufficient that 


(gio = (AF )0(4j)o (ex )ο. 


We are therefore led to adopt the convention that, in any coordinate 
transformation, the coefficients σι.) of the metric of a Riemannian 
space transform according to the relations 


gy = ΑΓ Αἴ ger. (72.5) 


Since the concepts of first-order representation and of tangential 
Euclidean metric at a point have an intrinsic character we can extend 
to Riemannian spaces some geometrical ideas which are Euclidean in 
origin. 
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73. Geometrical ideas derived from tangential Euclidean metrics. 
(7) Consider a point Mo in the Riemannian space V,, with the metric 


ds* = g,,dy' dy’. (73.1) 


Let &,, be the tangential Euclidean space at Mp which we suppose to 
be referred to the natural frame (77,e;) at mp associated with the 
system of curvilinear coordinates (y'). The metric of δ᾽, at mo is given 
by the quadratic form 

(gyody' dy’. 


The tensors which have components referred to the frame (7, e;) 
in δ᾽, will be said to define tensors at the point Mp of V,, with respect 
to the coordinate system (y’). On transforming from the coordinates 
(υ to the (y/) the frame (mpo,e;) is replaced by (rm, e;) where 


6, Ξ (Af )oey, ἐγ = (Aioe:, (73.2) 


and the components ¢’,, for example, of a third-order tensor trans- 
form according to the usual rule 


th, = (Abo (AL (Aol! "ἴω. (73.3) 


The components of different kinds of tensor T can be derived from 
one another by contracted multiplication with the quantities (gi)oand 
(go. 

If, for the moment, we designate the point chosen in V,, by M we 
may omit the index zero from the above formulae. In this way the 


complete Euclidean tensor algebra can be extended without modifica- — 


tion to vectors and tensors associated with a single point M of V,,. The 
algebra of these tensors is simply carried over from the tensor 
algebra of the Euclidean space δ᾽, by means of the tangential Euclidean 
metric. In particular, denoting two vectors associated with M by v and 
w, and their components ὃν οἱ and w’ respectively, their scalar product 
is given by 
vow = gyv'w’, 
(2) The introduction of tangential Euclidean metrics at a point 


also allows us to define, in Riemannian geometry, certain ideas 
involving 1,...,-dimensional domains of V,,. 
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Suppose, for example, that the space V,, is properly Riemannian. 
Using the same notation as before the infinitesimal distance between 
two points Mp, Min the Riemannian space is equal to the infinitesimal 
distance in Euclidean space between the two image points 


; i 2 
mgm? = (&j)ody' dy! = (gy)ody'dy’ = MyM’. 


Using the infinitesimal distance between two points in the Rieman- 
nian space given by 
ds = /(gydy' dy’) 


we may derive by integration the length of a finite length of arc AB by 
a formula identical with (58.3), i.e. 


dy'd 
aif [uti 


Consider an element of volume at the origin Mp .This corresponds 
to the volume element 


dV = ψ| 50] ἀν"... ἀν 


in the Euclidean space. We are thus led to adopt 


dV = ν [5] ἀν"... ἐν" 


as the expression for the volume element in the Riemannian space. 
The volume V of an n-dimensional domain will therefore be given by 


the integral 
V = | Isldy!...dy”. 


Analogous results could be given for various kinds of area. 

It should be stressed that the notion of the tangential Euclidean 
metric does not permit us to interrelate tensors associated with 
different points of Riemannian space, even neighbouring ones. 
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74. Osculating Euclidean metric at a point. Using the same notation 
as in §72, let us take a first-order representation and consider the 
point m, which is the image of M, determined by the curvilinear co- 
ordinates γ΄, From (61.7) we have 


82 τη i 
ΩΝ ἢ Ὁ ΓΝ 

where the Christoffel symbols on the right-hand side are constructed 
from the g;; of the Euclidean metric. This leads us to make the follow- 
ing definition: 

We say that a representation for the vicinity of Mo is of second 
order if it is of first order and if the formula (72.2) is replaced by 
—_> 


mgm = Ε -¥b+ 3 7} —y)0*— yi) + ¥is0"-¥0) e;, (74.2) 


where the Christoffel symbols of the right-hand side are constructed 
from the g,,and evaluated for γ᾽ = yg. The functions ¥3) are restricted 
to be at least of third order in the variables (y’ — y) in the vicinity of 
(y’—yo) = 0. 
Consider the metric 

ds* = g,dy' dy! 
of the Euclidean space δ᾽, referred to the system of curvilinear coordin- 
ates (»ἢ defined by means of a second-order representation. Since the 
representation is already of first order we have 


:s Wee 
ig on” 


(Zo = (gio. 
It also follows, on differentiating (74.2), that 


Wee Wish} 
(5 dyk]y ᾿ J Ἔ 


Comparing this result with (74.1) evaluated at γ᾽ = γᾷ we obtain 


Lite -Ξ HE 


and consequently 


(74.3) | 


i 


Riemannian spaces 79 


The Christoffel symbols of the first kind are therefore equal at 
γ᾽ = γῇ and, from (62.1), 


(2x8 yo = (2x 2y)o- (74.4) 


We have just seen that the Riemannian metric and the Euclidean 
metric (74.3) have the same coefficients and the same derivatives of 
these coefficients at y' = γῇ. We can also show, as in the previous case, 
that the concepts of second-order representation and of the osculating 
metric have an intrinsic character. These concepts will be used to 
define the absolute differential of a vector or tensor in Riemannian 
space. 

In order to gain a more concrete idea of this representation let us 
consider a simple example. Locally curved surfaces set in Euclidean 
space can be regarded as two-dimensional Riemannian spaces, Let S 
be asurface, Mp a point on S and δ. the tangent plane at Mo. Let each 
point M of S in the vicinity of Mp be put in correspondence 
with the point m which is the orthogonal projection of M on &. The 
correspondence between M and m defines a representation of second 
order; the metric of δ) osculates with that of the surface at Mp. 


75. Tensor fields in V,,. Absolute differentiation. We associate a tensor 
T with each point M of V,, in the following manner. Each point M is 
associated with a frame compatible with the metric at that point; the 
components of the tensor T can then be obtained with respect to such 
a frame. If, for each point M, this set of components is given as a func- 
tion of the coordinates (ν᾽), we say that we have a tensor field in Fo 
The gj; constitute such a tensor field. 

In order to compare tensors associated with two neighbouring 
points Mp and M of V,, it is necessary to inter-relate the corresponding 
frames in &,. To this end we use a second-order representation in 
ὅδ᾽, in the neighbourhood of Mp and adopt the natural frames associ- 
ated with the points 79 and m which are images of My and M. The 
tensor associated with the point M is represented by the image tensor 
defined by the components referred to the natural frame at m. 

When we change the second-order representation, the functions 
‘Y{3) are modified and the point m is replaced by a point which differs 
from it only by a third-order infinitesimal vector, the (y'—y}) being 
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the first-order infinitesimals. The vectors of the natural frame at m are 


modified by vectors which are second-order infinitesimals or less. It 
follows that the image tensor of the tensor associated with the point 
M is defined to the same order of approximation. 

Let Τὸ and T be two tensors associated with the points Mp and M 
respectively. They have, in any second-order representation, two 
image tensors whose difference is defined independently of the 
particular representation considered to the second order in the 
infinitesimals or less. The principal part of this difference between the 
image tensors is called the absolute differential of the tensor T. 

Consider, for example, a field of vectors in V,, defined by their con- 
travariant components οἷ, In a second-order representation the vector 
field has image vectors, corresponding to the points Mp and M, whose 
difference has contravariant components with respect to the natural 
frame at 779 given by 


(σοῦ Ξ- (du')o+ (w',)ovbs 
where 


(w',)o = rr dy*. (75.1) 


The Christoffel symbols on the right-hand side of this relation are 
those of the osculating Euclidean metricand are therefore equal to the 
Christoffel symbols of the Riemannian metric at y' = γῇ. 

Suppressing the index 0, we see that since 


ωἷ, = | ᾿ , ἀν" = T,',dy*, (75.2) 


then the absolute differential of the vector v at M has the contra- 


variant components 
Vo! = dv'+ai,v". (75.3) 
As in Euclidean geometry the quantities 
V,v' = δικοί T,!,0" (75.4) 


(where the I’, ',, are as defined above) are the components of a tensor 
called the covariant derivative of the vector νυ. If the vector field is 
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defined by the covariant componentszy, its absolute differential has the 
covariant components 


Vv; = dv;— wv, 


and the corresponding components of the covariant derivative are 
given by 
Vv; = 0,0;-Ty' vp. 


The vectors at two neighbouring points M and M’ are said to be 
identical if their image vectors in a second-order representation are 
identical; the absolute differential Vu' corresponding to the passage 
from the first vector to the second is then zero. 

The formulae which define the absolute differential or the co- 
variant derivative of any tensor may also be extended to Riemannian 
geometry using arguments similar to those above. 

To summarize, the concepts of second-order representation and of 
the osculating Euclidean metric permit the extension to Riemannian 
spaces of the Euclidean tensor analysis relating to tensors associated 
with neighbouring points. This holds for all the differential operators 
we have studied. It is important to note, however, in the case of 
vectors (for example), that when the absolute differential in Euclidean 
geometry is an exact differential satisfying the usual integrability 
conditions, there is no reason to suppose that this will also be true in 
Riemannian geometry. 


76. Acceleration vector of a moving point in V,,. Geodesics. In §66 we 
considered the motion of a point in Euclidean space. Let us now con- 
sider a moving point M in V,, whose position is a function of a para- 
meter ¢ which we shall interpret as the time. The velocity vector of M 
has the contravariant components 


and its acceleration vector y has the components 


. Vu! d? y! r . dy" dy* 


a = ee Kh di’ (76.1) 
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If the acceleration of M is zero its trajectory is called a geodesic of 
the Riemannian space V, Generally geodesics are defined para- 
metrically, for any t, by the solutions of the system of differential 


equations 
dy! dy" d 
τὰ a in (i = 1,2,...,2): (76.2) 


Take as independent variable along a geodesic C the distance s 
measured from some fixed origin of a point M along C. The vector u 
with components 


ui = dy’ 
ds 
is thus the unit vector collinear with dM and tangential to the curve. 
The geodesics are characterized by 
Vui ὧν" 
— = = Vk = u® Vu! = 0, (76.3) 


These equations show that transport along C between neighbouring 
points leaves the vector u unchanged. Geodesics are therefore the 
analogues in Riemannian geometry of straight lines in Euclidean space. 

However, in @,, straight lines are characterized by the following 
property: a straight line between two points A and B corresponds to 


an extremum of the length of arc AB with respect to the lengths of arc 
of all curves joining A and B. We may ask whether this is also true for 
the geodesics of Riemannian geometry. For simplicity we shall study 
a proper Riemannian space. 

Consider, in terms of some parametric representation, an arc 
joining two points A and Bof V,,. Ifa and ὁ represent the values of the 


parameter f at A and B, then from 873 the length of arc AB is given by 
the integral 


b 
| VUO*, » dt, 
a 
dy! 
where f=syy"y4, γ᾽- = 
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The curves representing the extrema of this integral, the extremals, 
are defined by the appropriate system of Euler’s equations in the 
calculus of variations, 
(ris). ἐν! 
dt dy” 
Carrying out the differentiations we obtain 
ate Lee ΟΡ 
dil /f oy" Vf ay” 
“(4)- of 1df a 
dt 


ay") ὃν" Bf dt By ~ ° — 


Since the parameter fis still arbitrary we take this to be the distance 
s along the curves considered. Under these conditions 


=0 @=1,2,...,). 


or 


d i 
y= =. f= gyy'yJ=1 (76.5) 
and hence the system of differential equations (76.4) reduces to 
d[ of path 


which are Euler’s equations for the function Κ We deduce from 
(76.5) that 


of of 
ets ὡς j ἐς πὰ ὃ J ytk 
By = 80s B= SRY Y 


Hence (76.6) takes the explicit form 


d , thos dy’! ofis 
5, (Buy) — B18 Y¥* = By + CnBy— BiB) Y¥* = 0 


(76.7) 
and, on introducing the Christoffel symbols of the first kind 


dy’ 
θύτης + Uk, ily’/y* = 0. (76.8) 
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After contracted multiplication by g" this becomes 


which coincides with the geodesic equation (76.2) when ὦ = 5. We can 
therefore formulate the following THEOREM: Jn Riemannian geo- 
metry, geodesics are the curves defined by the extremals of the integral 
representing the length of arc of any curve connecting two fixed points 


of V ,. 


Il. THE TRANSPORT OF EUCLIDEAN METRICS 


77. Mapping a curve of V,, on Euclidean space. Consider any curve 
C of V,, which is described parametrically. The coordinates γ' of a 
point M of C are functions of some parameter ¢. Denote a certain 
point on C by A; for example, that point which corresponds to the 
value ¢t = 0. 

Each point M of C is made to correspond to a point m with the 
natural frame (m,e;) in the Euclidean space &,, in the following way: 

(1) To the point A there corresponds an arbitrarily chosen point 
a with the frame [a, (e;)9] whose orientation is not determined. The 
magnitudes and relative inclinations of the (e;)y are, however, well 
defined by the relations 


(eo (6))ο = (4 ,)ο (77.1) 


where the (g;;)9 are the coefficients of the Riemannian metric at the 
point A (t = 0). 
(2) The point m and the vectors e;satisfy the equations 
ἀπι dy’ de; ry 
ee: (I; mM Ch; (77.2) 
where the (Γι) are the values of the Christoffel symbols at the 
point M and are consequently functions of the single parameter ¢. 
The integration of the system of differential equations (77.2) with 
the initial conditions (/) defines a point m and a frame (mm, e;) for each 
value of ¢ and similarly for every point M of C. The path y in @,, 
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followed by m when M describes C is called the mapping (or develop- 
ment) of C on the Euclidean space. If the initial conditions (/) are 
varied the initial frame [a, (e;)9] is changed and so is y. 

As an example of such a mapping imagine a curve C to be traced on 
a surface S and apply a developable surface to S along C. If this 
surface is now laid on a plane it defines a plane curve y which can be 
thought of as a flat map of C. 


78. The transport of Euclidean metrics along a curve. We wish to 
establish the following fundamental theorem concerning the mapping 
y of C: 

It is possible to find a metric in @,, such that the numerical values of its 
coefficients and their first derivatives along y coincide with the numerical 
values of the coefficients of the Riemannian metric and their first 
derivatives at the corresponding points of C. In other words, it is 
possible to construct a Euclidean metric which osculates with the 
Riemannian metric simultaneously for all points of C. 

In order to simplify the notation we make a coordinate transforma- 
tion such that C is defined by the equations 


yoy=...=y"!=0, 


and take y” to be the parameter 1. It will be convenient to use Greek 
indices to represent the range of values (1,2,...,2—1) and Latin 
indices to represent (1,2,...,2—1,). With these conventions the 
equations (77.2) which determine the mapping γ of C take the form 


dm de; 


ay" = e,, a" = (Γ, να. 96). (78.1) 


Suppose that a correspondence is set up between each point P of 
V,, in the vicinity of M and a point p of @,, in the vicinity of m in the 
following way: if P has the coordinates γ᾽ and m is the point of y 
whose parameter is y”, then the point p is defined by 


Su! Ἃ i λ λ 
mp = y°e,+[3(1'y',),*-0¥ "+ Ῥῳ 0, (8.2) 
where the ¥,) are of third order in the γὰ, 


t 


NL A ὩὩ πο “σα σαν 
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Equation (78.2) defines the point p of &,, as a function of the n scalar 
variables (y'). The (Οἱ) therefore constitute a system of curvilinear 
coordinates for δ᾽, in the vicinity of y. The natural frame at m(y* = 0) 
for this system of curvilinear coordinates is, from (78.1) and (78.2), 
defined by the vectors 


(ay (B). ates 
ay’) a9 λ» ay") a9 dy" mn 


which is to say that it coincides with the frame (m,e;) obtained in the 
mapping. The metric of δ᾽, πὶ the coordinate system (y’) has the scalar 
products 6;" 6; as coefficients at m. Using (77.2) or (78.1) the quantities 
e,e; satisfy the equations 


de;e) = (0 )ulerentCduerepldy*® (8.3) 


as M describes C. 
On the other hand, according to the definition of the Christoffel 
symbols, the Riemannian metric coefficients g,; satisfy 


dgy = (μὲ Γμὺ ἀν", 


therefore 
dgy = [ΓΟ ugat Ce )ugulad*. (78.4) 


The quantities e,-e; and g;; thus satisfy the same differential system 
as the point M describes C. Moreover, from (77.1), the initial condi- 
tions at A are the same. It follows that 


ΘΟ," e; ξ 9 ij 
identically as M describes C. The Euclidean metric and the Rieman- 
nian metric are therefore tangential at all points of C. 
To demonstrate that they are osculating it is now sufficient to 
establish that the quantities (I°';) ,«_.» are the values of the Christoffel 


symbols on y for the Euclidean metric. These symbols are the co- 
efficients of e, in (2*p)/(@y! ay’). Now, using (78.2), 


ep ays 
(53 whi = λίνα οι 
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and from (78.1), 


ep εἷς d op r de; Ἧς h 
(5 a ia AR)... = gy | En dya—08m 


which completes the demonstration of our theorem. The Euclidean 
metric obtained will be called the Euclidean metric transported along 
C. 


79. Geometrical applications. A number of geometrical results can 
be obtained from the concepts of mapping and the transport of 
Euclidean metrics. We limit ourselves to a few of these. 

Let C bea geodesic in the Riemannian space V,,. Each point M of C 
is associated with the unit vector u tangential to C and, when we pass 
from the point M to a neighbouring point of C the absolute differ- 
ential of u is zero. Now this absolute differential is the same as that of 
the vector image in &,,, assuming a representation of second order 
in the vicinity of M (for example, that which defines the trans- 
port of a Euclidean metric). The image vector is therefore simply the 
unit tangent vector to the curve y which is the mapping of C. It 
follows that the curve y is a straight line in &,,. We may therefore state 
the THEOREM: 

The geodesics of a Riemannian space are those curves which map on 
Euclidean space as straight lines. 

Using this theorem it is easy to deduce, in a purely geometric fashion, 
that the geodesics of Riemannian space are the curves which define 
the extreme length of arc between two fixed points, but we only 
mention this in passing. 

Consider now any curve C of V,, and suppose that a vector v(M) is 
associated with each point M of C and is continuously variable as M 
describes C. The absolute differential of vy on passing from the point 
M to a neighbouring point is equal to the difference between the 
image vectors in δ᾽, for the representation associated with a Euclidean 
metric transported along C. We are therefore led to measure the 

finite geometrical variation of the vector v(M), on passing from a 
point A to a point B along the curve, as the difference between the 
image vectors of v(A) and v(B). 
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Suppose a vector field ν Μὴ to be given in V,,. The finite geometric 
variation of a vector of the field in passing between two points A and 
Bis, according to the above definition, dependent on the path Cin V,, 
taken between A and B. 


ΠῚ, CURVATURE TENSOR OF A 
RIEMANNIAN SPACE 


80. Mapping a quasi-parallelogram. In our study of Euclidean space 
using curvilinear coordinates it was pointed out that, in order that a 
differential quadratic form might define a Euclidean metric, it is 
necessary for the conditions of integrability of the de; to be satisfied. 
In Riemannian geometry these conditions do not, in general, hold. 
We can, however, interpret the de; geometrically using a method due 
to Cartan. 

In order to evaluate the second derivatives of a scalar or vector 
function of the variables (y’) it is necessary to carry out two succes- 
sive differentiations which correspond to distinct variations of the γ', 
We generalize this procedure by introducing two distinct differenti- 
ation symbols. 

Starting with any values y’ of the variables we introduce the arbi- 
trary variations dy’. Let these variations define a differentiation 
symbol d. In a similar fashion let Sy’ be a second set of arbitrary 
variations defining another differentiation symbol 5. Thus, starting 
from the values (y'+dy') of the variables and effecting the ὃ differ- 
entiation, we obtain the values 


(y! + dy'+ δνί- Sdy’). (80.1) 


If, starting with the values (y'+ Sy"), we perform the differentiation 
d, we likewise obtain the values 


(y! + 8y!+ dy! + ddy'). (80.2) 
These two sets of values are identical if we arrange that 


εἰδν' = Sdy’. (80.3) 
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We then say that the two differentiation symbols commute for scalar 
functions or, more briefly, commute. If f(y) denotes a twice contin- 
uously differentiable function of the γ᾽ then 


of = of dy’ 
and . ἐδ = ὃ, dy‘ dy! + a,fddy’. 
Using the symmetry of the second derivatives and (80.3) we have 
εἰδῇ = ddf. (80.4) 


We note, in particular, that the commutation property of the two 
differentiation symbols holds for an arbitrary change of variables y’. 

Consider two interchangeable differentiation symbols and, starting 
from the point M of V,, with coordinates γ᾽, carry out the differentia- 
tion dwhich changes M(y') to M+ dM(y'+ dy’) and the differentiation 
ὃ which changes M to M+ 5M(y'+ 5y'). We suppose that dM and 5M 
are not collinear, which is equivalent to saying that dy‘ and Sy’ are not 
proportional. Starting from M+dM perform the differentiation ὃ 
which carries it to the point M’ with coordinates (80.1). Because of 
the interchangeability of dand 5, we obtain the same point M’ as if we 
had performed the differentiation d starting from M+ 6M, since the 
coordinates (80.2) of the point reached coincide with (80.1). The 
closed path defined by the four points (M,M+ dM, M’, M+ 6M) will 
be called a quasi-parallelogram. 

We now propose to map the alternative paths in the quasi-parallelo- 
gram between M and M’ on the Euclidean space @,,. The point M cor- 
responds to a frame (νι, e;) in &,,. If we carry out first the differentiation 
d and then the differentiation 5, we map the sides (M,M+dM) and 
(M+dM,M’) of the quasi-parallelogram. The frames (m,e,;) and 
(m+dm,e;+ de;) are related by (77.2). Hence 


dm =dy'e,, de; = w(d)e,, (80.5) 


where ἡ (4) denotes the differential form w”; evaluated at (y' + dy’). 
Mapping the side (M+dM,M’) then carries us from the frame 
(m-+dm,e;+ de;) to the frame [77;, (e;);] with 


——p 
mm, = dm+dm-+ddm, 


(e Dt “θ᾽ = de;+ de;+ dde;. 
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Reversing the order of differentiation we pass first from the frame 


(m,e;) to the frame (m+ 5m,e+ de;) with 
Se; = w"(8)e;; 
and then from this frame to [773, (e})2] with 


Sm = dy/e,, (80.5’) 


—* 

mm = m+dm-+ddm, 
(e5)2—e; = de;+ de; + dde;. 

It follows that [7;,(e});] to [73,(e{)2] are related by the formulae 


pt A 
mm; = ddm—ddm, 
(e})2—(e))1 = dde;— dde;, 

where we wish to evaluate the right-hand sides. We have 

d3m—5Sdm = d(Sy'e;)—S(dy'e;) = dy'de;—dy'Se,, 
and from (80.5) and (80.5’) 

dim—8dm = (I°,',dy* δγ'-- Γρ dy! 8y") 6). 

Interchanging the indices i and k in the second term on the right-hand 


side and using the symmetry of the Christoffel symbols in their lower 
indices we find that 


dim—8dm = (I',",—I'j',) ἀν" δίς, = 0. (80.8) 

The two mappings thus lead to the same origin m’ for the final 

frame. This is essentially due to the fact that the Christoffel symbols 

are determined in such a way that the integrability conditions are 
satisfied. 


Now compare the vectors of the two frames with origin at m’. We 
have 


dSe;—8de; = d[w",(8)e,]—8[w",(d) e;] 
= [dw",(8)—de",(d)]e,+ w* (8) de, — w*(d) Se, 
and using (80.5) and (80.5’) 


dde;—8de; = [de",(8)—Sw"(d) + w* (8) w",(d) — w*(d) ωἰ κ(δ)] 6. 


(80.6) 
(80.7) 


] 


. 
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We write 
dde;—dde; = ΩΝ δ. (80.9) 
with 
ΩΝ, = ἀω" (δ) -- δωδ (() + w* (8) w"(d)— w*(d) w",(8). (80.10) 


For an arbitrary Riemannian metric, the quantities 2”; are, in 
general, different from zero and the frames associated with the two 
mappings have different orientations; however, they have the same 
form and magnitude since the scalar products of the vectors of these 
frames are given by the coefficients of the metric at M’. The 2", thus 
define the rotation at m’ that is required to pass from one frame to the 
other. 

It is important to realize that the 2", are the components of a 
tensor. A change of curvilinear coordinates corresponds to a change 
of frame defined by 

6; = Af ey, 


from which we deduce that 
de; = Al δεν" δΑ7 ey 
dée; = Af d8ey+dAj δεν SA] dey+d5A] ey. 


On interchanging the two differentiation symbols and taking the 
difference between corresponding terms, it follows, since d and ὃ are 
interchangeable when acting upon the 4/, 


dde;— de; = A] (d8ey—Sdey), 
and on introducing the 2"; as in (80.9) 
Q" 6, = AL Q" pep = Af Apo" yey. 
Identifying the coefficients of e, on each side we obtain the tensor 
transformation 


We therefore state the following THEOREM: 
The quantities Q", are the components of a mixed tensor. 


and 
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81. The Riemann-Christoffel tensor. It can be shown that the quanti- 
ties 2", are bilinear forms with respect to the dy’ and ὃν", The co- 
efficients of these forms can be evaluated in terms of the Christoffel 
symbols. We have 


dw" (δ) = ἀ(Γ," δ» = ὃ, Γι" dy" dy°+ Tf dby', 
hence 

des" (8)— Sw") = ὃ, Γι᾿ ἀν" by’ — ὃ, Γ᾿ δγ" dy’, 
and on interchanging the indices r and s in the second term on the 
right-hand side 

doo"; (8) —8w"(d) = (8, Γ,ι",-- ὃ, Γι) dy’ dy’. 
Furthermore 
ωἰ (δ) wd) -- οἱ (4) wo" (δ) = (PT) - TT) dy" ὃν". 


We conclude from (80.10) that the 2”, can be expressed as the bilinear 
form 


2", = Rit pgdy" By’, (81.1) 
where 
Ri. ὃ, Γι- δι Γ,ΝῈ DAT EAT AT. (81.2) 


Since dy’ and dy* are the contravariant components of two arbi- 
trary vectors and the 2”; are components of a tensor, it follows that 
the quantities (81.2) define a tensor field in V,. The tensor R/' ,,, 
which is obviously antisymmetric with respect to the indices r and s, 
is called the Riemann-Christoffel tensor or curvature tensor of the 
Riemannian space V,,. The curvature of a Riemannian space V,, is 
thus characterized by the fact that if two distinct paths with the same 
extremities are mapped on Euclidean space starting with the same 
initial frame, then the final frames have a different orientation. 

It follows from (80.9) and (81.1) that the conditions for the integra- 
bility of vectors may be expressed by the equations 


Rj rs = 0. (81.3) 
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Given an arbitrary quadratic differential form, the conditions 
(81.3) have to be satisfied for it to be a metric in Euclidean space. It 
can be shown that, when the corresponding manifold is topologically 
equivalent to Euclidean space, these conditions are also sufficient to 
make V,, Euclidean; when this is not so, the Riemannian space for 
which the conditions (81.3) are satisfied is said to be Jocally Euclidean.t 
Such a space does not differ from a Euclidean space in its local 
properties. 


82. Covariant components of the Riemann-Christoffel tensor. In order 
to carry out the calculation of the covariant components of the 
curvature tensor in a simple manner we adopt the following purely 
symbolic convention: an index which the covariant differential 
operator V, does not act upon is said to be mute, and is written in 
parenthesis, With this notation we can write 


se ὁ (i's) = ὃ, x ΕΓ st " 


and consequently 
Ri rs — V, r i's) Vs Γ (i's) 


It follows, using Ricci’s theorem, that 
Ἀν, = Sn Ri rs = Vilsal i 91-ν [60 lay), 
or Ries = VL s)—Vsl wien 
Substituting the explicit form of the terms on the right-hand side 
Ry. rs = 8D ys— 2s yp — Ty Γ, ἘΓΙΓ, (82.1) 


The second derivatives of the g;; only occur in the first two terms of 
the right-hand side of (82.1). Let us write these terms explicitly: 


ὃ, Γὼς = 4,[is,j] = $8,(8,¢ y+ ὃι5)ς-- 98is), 
OL ir = δ fir, ἢ — $0(0,gy+ 8:8 jr— 28 ir)» 


+ For example, the surface of a cylinder set in &; is locally Euclidean, but 
it is not topologically equivalent (i.e. homeomorphic) to a plane. (T.) 
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which give by subtraction, 
Ry, cae 39 jp Bjs+ δ,.8 ἐγ δ,,8 i 9:58 ir) —g' ὋΣ rmj i ils~ ἐμ smj Lr, ifr) 


(82.2) 


Certain symmetry relations can beseen from equations (82.2). These 
appear even more clearly in a conveniently chosen coordinate system, 


83. Normal coordinates. Relations between the components of the 
curvature tensor. Given a point M of V,,, carry out a change of vari- 
ables from (y*) to (z') which are such that the Christoffel symbols 
vanish and the gj, remain unaltered. Then, in a second-order repre- 
sentation, the Euclidean space @,, will be referred to a system of 
rectilinear coordinates associated with thenatural frame (m,e;) at M. 
Since the natural frames at M for the (y') and the (z‘) are the same, the 
components of a given tensor at M are the same in both systems of 
coordinates. The coordinate system (z') is called the normal coordinate 
system at M associated with the coordinates (y'). The use of such 
normal coordinate systems is often valuable in avoiding lengthy 
calculations in Riemannian geometry. We shall use them to establish 
certain relations between the components of the Riemann-Christoffel 
tensor. 

The covariant components at M of the curvature tensor (which 
have the same values in the two coordinate systems) are given in 
terms of the normal coordinates at M by the relations (82.2) with the 
Christoffel symbols omitted: 


Ry, rs = Hr 8jst %sSir— 2%rBis— P%isSir)s (83.1) 


the quantities on the right-hand side being evaluated with respect to 
the normal coordinates. We deduce that, for any coordinate system,f 


Ry, i Κ,, ys (83.2) 
Ri, rs — — Ry, sr > — Ry, rs¢ (83.3) 
+ These equations certainly hold in a normal coordinate system and, 


being tensor equations, they must also hold in any other coordinate 
system. (T.) 
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Also, permuting the indices j, r, 5, we obtain 
Rir, wie $(9;,8j,+ i is— 9738 — δῃ8 ra) 
Ris, jr = YHOySrst OrsSy— %sSir— 9ιν8).)» 
and adding these results 
Κι, “at Κι, yt Ris, jr = 0. (83.4) 
Equations (83.3) and (83.4) form a complete set of identities for the 
components of the Riemann-Christoffel tensor: each identity con- 
necting the components Rj, ,, is satisfied by a set of numbers Ry ες 
restricted only by the conditions (83.3), (83.4). Every other identity is 
an algebraic consequence of (83.3) and (83.4) — this is true in particular 
of (83.2). 


84. Second-order covariant derivatives of a vector. Given a vector 
field in terms of the contravariant components v" we wish to evaluate 
the difference between V,(V,v") and V,(V,v"). In Euclidean space this 
difference is zero but in Riemannian geometry it depends upon the 
curvature of the space. Consider a system of normal coordinates at 
an arbitrary point M of V,,. By definition, at any point of V,, 
V,v' = δι Te! 
Since the coordinates are normal at M, the Christoffel symbols vanish 
there and we have 
VV υἢ aT 2,50" + 8, Γ' As 
It follows that 
VAV,0")-V,(V,0") = (2, rr 0,0," ,)o!. 
However, in a normal coordinate system we have, from (81.2), 
Ri rs — 6,I°',- Dad sos 
Therefore 
VAV,0%—V,(V,0"%) = Ri ,,v'. (84.1) 

Since the terms of this identity are tensors, (84.1) is valid for any 
system of coordinates and for any point of V,,. If it had been estab- 
lished directly, the identity (84.1) might have been used to introduce 


the Riemann-Christoffel tensor. This tensor was, in fact, originally 
introduced in this way. 
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85. The Ricci tensor. Let us investigate the tensors which can be © 


formed from the Riemann-Christoffel tensor by contraction. Since 
the components Rj ες are antisymmetric on the one hand with respect 
to iand and on the other with respect to r and s, contraction of i with 
j, or of r with s merely results in a tensor which is identically zero. 

Consider, however, the contraction of an index from the first group 
with one from the second — using, for example, the second and third 
indices. In this way we obtain the tensor 


This tensor is evidently symmetric with respect to the indices i and j, 
since 
Ry = 2 Rix, Ξ 8 Ruy, ix = 8" Ringer = Κρ. 

If we contract either index of the first group with either one of the 
second we always obtain either the tensor R;; or its negative since, 
from (83.3) 

o Rei jn = 8" Riz = Ry 


and go Rie i = 8 Ray = -- Rizw = —Ry- 


The symmetric tensor Ry, which plays a fundamental role in relativistic 
gravitational theory, is known as the Ricci tensor. Using (81.2) and 
(85.1) we can deduce the following expression for this tensor: 


Ry πῶ ΓΡ πῶ ΓΈ ΓΙ Tj, -C i, 7). (85.2) 
Contracting the Ricci tensor we obtain the invariant 
R=R/ =g"Ry (85.3) 


which is called the scalar Riemannian curvature of the space V,, 

In the Riemannian space realized by an ordinary two-dimensional 
locally deformed surface, the scalar Riemannian curvature is equiva- 
lent to what is known, in differential geometry, as the total curvature 
of the surface. This total curvature is known to depend only upon ds’, 
the line element in the surface. 


86. Bianchi identities. There exist identities other than those implied by 
(83.3) and (83.4), between the components V,Rj ,, of the tensor 
derivative of the curvature tensor. 
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In order to establish these important identities we again use a 
system of normal coordinates at an arbitrary point M of V,,. The 
Christoffel symbols are then zero at M and using (81.2) we deduce by 
differentiation that, at the point M, 


ON RE a σῦς ΤᾺ, (86.1) 
A cyclic permutation of the indices r, s and ¢ gives 
V,Ri* st = 9,0 tim ὃ. Γ ay 
VaRi tr = Og ΓΙ, — 8,51". 
Adding these expressions we obtain 
VRE tt VRE et V,Ri rs = 0. (86.2) 


From their tensorial form these identities are obviously valid in any 
coordinate system and for any point of V,,. They are known as the 
Bianchi identities. 
Double contraction of (86.2) leads to an important consequence 
relating to the Ricci tensor. Putting ¢t = ἃ we obtain 
-V, Κι, Vv, Ri, ἘΝ, Ri rs = 0, 
then, contracting i and s, 


—V,R+V,R’,+V,R', = 0, 


or 2V,R°,-V,R = 0. 
Using Ricci’s theorem this may be rewritten in the form 
V(R*,—$g°,R) = 0. (86.3) 
It follows that the symmetric tensor 
Sis = ΑΘ, τ ἐξ, Κα (86.4) 
satisfies the identities 
V,S*, = 0. (86.5) 


These identities are fundamental to relativistic gravitation theory in 
which they are used to introduce conservation principles. 
7 


PART II: APPLICATIONS 


CHAPTER VI 
Tensor Calculus and Classical Dynamics 


I. DYNAMICS OF HOLONOMIC SYSTEMS WITH 
TIME-INDEPENDENT CONSTRAINTS 


87. Configuration space as a Riemannian space. Consider a dynamical 
system S with time independent holonomic constraints and ἡ degrees 
of freedom. The set of configurations of such a system constitutes an 
n-dimensional differentiable continuum which is called configuration 
space. We describe S by the parameters (q',...,q”) which form a 
coordinate system in the configuration space. According to our 
definition of S, its kinetic energy T is a positive definite quadratic 
form in the time derivatives of the q’; 


dq' 
2T = αῃη 4’ («' = “\ (87.1) 


where the a, are functions of the parameters q'. We can associate the 
dynamical system S with the proper Riemannian space V,, defined by 
the configuration space associated with the metric 


ds* = 2Τ αἴ, 
which may be rewritten 
ds* = aydq' dq’. (87.2) 


A definite point M of the configuration space corresponds to each 
configuration of the system in such a way that a displacement of the 
point M in the Riemannian space V,, is associated with every dis- 
placement of the system. We propose to represent the motion of the 
system S by that of a point in the Riemannian space. 

[98] 
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88. Kinematics of the point Μ΄ Let us first complete the kinematic 

considerations sketched in Chapter V. The velocity vector, v, of the 

point M whose coordinates are φ', has the contravariant components 
i 


i “a 
Ὁ Ξ --- = Ν 
a? 


The covariant components of this vector are consequently given by 


V,=-a *j = aT 
i Ted aq’! (88.1) 


We thus see that the momenta p;, which continually appear in 
analytical dynamics, are just the covariant components of the velocity 
vector of the representative point M in the Riemannian space V,,. 
Let us denote the unit vector tangential to the trajectory, C, of M by 
u; this has the components 

aM, 
ds 


Hence 


and the magnitude of the velocity vector is given by 


2 -- ds\" _ or 
= [2} = 27. (88.2) 


Now consider the acceleration vector. Since u is a unit vector its 
absolute differential, as M describes C, is perpendicular to it. Conse- 
quently we may put 


Vu με n' (88.3 
p ng 


ds 
where the n' denote the components of a unit vector normal to uand p 
is a positive scalar quantity. The vector n will be called the principal 
normal vector to C and p~! the curvature of C in V,, (88.3) is a 
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generalization of Frenet’s first formula in elementary differential 


geometry. Using 
v= vu, 


and differentiating with respect to time, we have 


w- 
or γ' = —ul+—a. (88.4) 
P 


Hence the acceleration vector y decomposes into a tangential and a 
normal acceleration given by the same expressions as in classical 
mechanics. 


89. Equations of motion. Let 
O;dq' 


denote the infinitesimal amount of work done by the given applied 
forces acting on S in a small arbitrary virtual displacement. This 
expression is invariant under a change of parameters if the Q; are the 
covariant components of a vector in V,,. The vector Q; is called the 
generalized force vector. The motion of S is determined by the 
Lagrange equationst 


Ρ, = «(ἡ-ξ = Οἱ. (89.1) 


Let us try to interpret the components οἵ P;. We have 


oT / eT ! 
= J agi = $0,4;.q'/q"". 
Hence the P; have the explicit form 
d 
ΡΞ » (ayq’2)—42,4;.q'/q"*. (89.2) 


+ These equations are discussed in any elementary account of analytical 
mechanics. They are derived from Newton’s laws of motion which relate 
the forces to the acceleration of the individual particles of the system. (T.) 
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However, from a calculation identical with that given in §76, we see 
that the right-hand side of (89.2) is equal to 


di ΒΗΝ ee 
ay—, + Uk, ila Iq*, 


where the Christoffel symbols are defined in terms of the ay. We 
deduce that 


Soe en 
;™= "TF = at = Vie (89.3) 


Thus the quantities P;, which appear on the left-hand side of the 
Lagrange equations, are just the covariant components of the 
acceleration vector of M, and the Lagrange equations may be 
rewritten as 

γι = Qi. (89.4) 


The equations of motion of M are thus obtained by equating the 
acceleration vector in V,, to the generalized force vector; that is by 
writing down for M the precise generalization of the equation of 
motion in particle dynamics and considering M to have unit mass. 

Using (88.4) we may write the equations of motion as 

ἄν υ 

an = οι. (89.5) 
It follows that the generalized force vector is always coplanar with 
the tangent and the principal normal of the trajectory of M. 

If the QO; are zero, the point M has zero acceleration in V,,, which 
implies that 


dt p 


The motion of S in the absence of forces is therefore associated with 
the uniform motion of M along geodesics in V,,. 


90. Energy integral. Suppose that the system S can be associated with 
a time independent potential function U of the parameters q' which 
satisfies 

QO i ΞΞ σ᾿ ὃ i U. . 
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Then taking the scalar product of each side of (89.5) with the vector v 
which has the components οἱ = wu! we find, allowing for the ortho- 
gonality of vy and n, 


It follows that the equations of motion of M have a first integral 
which is a generalization of the energy integral relating to a single 
particle in elementary dynamics: 


}07+U =h (ἢ = const.). (90.1) 


According to (88.2) this is also the energy integral of the physical 
system 5S. 


91. Principle of Maupertuis. We retain the assumption which led to 
an energy integral in the previous section and take a particular value 
for the arbitrary constant appearing in the function U. It is then easy 
to derive a generalization of the principle of Maupertuis regarding 
motion in the absence of forces and to give a geometrical interpreta- 
tion of this. 

Consider a parametric representation of the motion of M in which 
the coordinates q' and the time ¢ are expressed as functions of some 
parameter τ. We suppose that the relation between 7 and ¢ is known, 
and write 


= F. (91.1) | 
We have 
1-4 _ 4. 
T dt ie 


and the quantities P; may be written 


d dq/\ F dq! 
᾿Ξ τί » ὧν ΜΑ ite αν | 
P; |< Fay | 2 ἐκ dr dr 
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Hence we deduce that the trajectory, C, of M can be represented in 
terms of the parameter τ as the solution of the system of differential 
equations 

ἀ{,. dq’ dqidq* Ὁ 

ὦ -- δια ; 91.2 
alFut)-5 5 αν ΠΡ ΩΝ τ 


The form of the left-hand side of (91.2) brings to mind the equations 
which determine the extremals of 


o = [ἀναγ ἀρ), (91.3) 


where Fis a function of the gq’. Let us determine the extremals of (91.3) 
assuming F to be a known function. These correspond to the geodesics 
of the Riemannian metric 


ἐσ = Fds* = Fajdq'dq’. (91.4) 
We take σ itself as parameter on these geodesics and, in order to 
simplify the notation, we put 
i 


inal 


It follows from (76.7) that the differential equations which determine 
the geodesics can be written 


© (Fay?) —40(Fay 4) 4" = 0, 


so that 
d + at - δέ ὴς ; 
Fo Fad) — 5 Craw 4) 4" = WO,F)anq/g*. ΘΙ.5) 
From (91.4) 
Faj,.g/ * = 1, 
so that (91.5) may be rewritten 
d ὄζοι ΩΣ 18.Ε 
ης Fat) 5 διαμ "4" = - -τ-. (91.6) 
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We can identify (91.2) with (91.6) if we take 


F = —2U+const., 
so that 
0; = —26,U. 


Among all the trajectories which are solutions of (91.2) consider 
those which correspond to a given value of the energy constant A. 
(88.2) substituted into (90.1) gives 


2T = 2(h-U). (91.7) 


The parameter τ is completely determined on these trajectories 
(apart from an additive constant) if we put 


F = 2(h-U). (91.8) 
Using (91.4), the parameter o is determined by 

do 

ΩΝ /(F.2T) 
or, from (91.7), 

da dr 

Pte eons 


The parameters o and 7 are therefore identical and the trajectories 
under consideration satisfy the differential system (91.6), that is to say 
they are the geodesics of the Riemannian metric 


do* = 2(h—U) ajydq' dq’. 


We now express this formally: 

MAUPERTUIS’ PRINCIPLE: Given the form of the energy integral, 
the trajectories of a dynamical system which are determined by a 
particular value, h, of the energy constant are the geodesics of the 
configuration space with the Riemannian metric 


do* = 2(μ-- ) 452 = 2(h—U) aydq' dq’. (91.9) 
These geodesics are given by 
do 
a 2(h—-U). (91.10) 
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92. Some applications. The introduction of Riemannian spaces 
associated with the configuration spaces gives us a geometrical picture 
for all problems concerning the dynamics of holonomic systems with 
time independent constraints. The techniques of tensor calculus may 
therefore be applied to such problems. The use of tensor methods in 
analytical dynamics has resolved a certain number of problems for 
which solutions could not be obtained using only the methods of 
analytical dynamics. This is so, in particular, in the case of the problem 
of the transformation of the equations of motion which was posed by 
Painlevé and Levi-Civita and which can be formulated as follows: 
Under what conditions do two systems S and S’, which share the 
same configuration space, have the same trajectories independently 
of the way in which these trajectories are described in time 27 

The stability of the motion of a dynamical system S is best dis- 
cussed by reference to the configuration space Κ΄, with the metric 
ds? = 2T di” using a method analogous to Levi-Civita’s method of 
geodesic spread.t 

We mention finally the research on reducible dynamical systems §: 
a dynamical system is said to be reducible if, for a certain choice of 
parameters q’, its energy T can be expressed as the sum of the energies 
of two systems having respectively r and (n—r) degrees of freedom 


r 
2T; = Σ aag(q's-.4'4" a, 


n 
27; = & bapa’ * ἢ °° υ 444, 
r+ 
the potential function U also being the sum of two potential functions 


σα 4 Ug" *',..5¢"). 


+ Ῥαινιενέ, ‘Sur la transformation des équations de la dynamique’, 
Journ. de Math. (1894); Levi-Crvit, ‘Sulle transformazioni delle equazioni 
dinamiche’, Ann. di Mat. (1896); THomas, T. Y., ‘On the transformation of 
the equations of dynamics’, Journ. of Math. Phys. (1946); LICHNEROWICZ, 
‘Sur la transformation des équations de la dynamique’, C. R. Acad. Sci. 
(1946). 

t Levi-Crvit, ‘Sur l’écart géodésique’, Math. Ann. (1926); SYNGE, ‘On 
the geometry of dynamics’, Trans. Roy. Soc., Lond. (1926). 

§ STACKEL, C. R. Acad. Sci. (1895); Tuomas, T. Y., “Reducible dynamical 
systems’, Journ. of Math. Phys. (1947). 
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Given a dynamical system S referred to any set of parameters (αἴ), it is 
of interest to determine under what conditions it is possible to find a 
set of parameters (q') which allow the reduction of the system. 

The explicit solution of these problems goes beyond the scope of 
this book and we refer the interested reader to the original papers. It 
is worth noting that, to some extent, the considerations of previous 
sections can be extended to non-holonomic systems with time 
independent constraints.t 


Il. DYNAMICS OF HOLONOMIC SYSTEMS WITH 
TIME-DEPENDENT CONSTRAINTS 


93. Configuration space-time. Consider a dynamical system having 
n degrees of freedom and with time-dependent holonomic constraints. 
Since the constraints are time dependent the possible configurations 
of the system depend upon the instant of time under consideration. 
We therefore substitute a configuration space-time for configuration 
space, that is a continuum V,,, ; with (ἡ + 1) dimensions which has the 
parameters q' (i= 1,...,”) of the system and the time t= φῦ, as co- 
ordinates. This coordinate system is restricted by allowing only 
coordinate transformations of the type 


q =f'(¢°,q) κ᾽ = q°+ d(q') 


(i, etc. = 1,...,27). 
It will be convenient to write 


dq* 
c= — > tc. = 9" γε εν. 
q γα (x,etc. = 0,1 n) 
so that 
i 
q°=1, gia 5 


a: 
In this notation the energy of 2 is given by 
| 2T = aggq’*q’?, (93.1) 
+ See, for example, the work by Synge just quoted. 
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where the a,g are functions of the gq’. We associate the dynamical 
system 2 with the Riemannian space V,,; defined by configuration 
space-time with the metric 

ds* = 2Tdt” = a,gdq*dg?. (93.2) 
Each configuration of the system at time ¢ corresponds to a com- 
pletely determined point M of configuration space-time. Each 
possible displacement of the system is associated with the displace- 
ment of the point Min the Riemannian space V,, , 1, this displacement 
being defined by giving the n coordinates qg' as functions of the 
(n+1)-th coordinate q° = t. 
The velocity vector of M has the contravariant components 
v* τε α΄ and its covariant components are given by 


The m components v, are thus equal to the momentum components, p;, 
of the system 2. The acceleration vector y has the covariant com- 
ponents 


γα = age = aapq’? + [By, alg’Pq’”. (93.3) 


We note that g”° = 0. In particular it follows from (93.3) that 
Yo = 49: 9"'+ Lik, 014, 4" -Ὁ 2;a90q"+42aq. (93.4) 


94, Equations of motion. Once again let 


Q;dq' 
be the infinitesimal amount of work done in a small arbitrary virtual 
displacement by the forces applied to 2. The motion of 2 is deter- 
mined by the Lagrange equations 
djoT\ oT 
dt we) ᾿ δαὶ 
Multiplying these by g“ and summing 


d “i oT oT wi oT ‘ =. “i 
αἱ! πὶ (in? Ee all 


= Ὁ. (94.1) 


i= 
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Using Euler’s theorem for homogeneous functions, we have 


oT oT 
ai 70 
q'—+q°— = 27 
gly nl _ aT ὃΤ' 


δη agi ar aq” 
so that 
ἔν ch i le ale 
ἐπ alge) ap τ ae 20" ΤΣ 
We now wish to interpret the quantities P, which appear in (94.1) 
and (94.2). As in §89 we have 


P, = aapq’P + [By, «]q’Pq’”. 
Putting 
Py = γα: 
the Lagrange equations (94.1) become 


γι = Q; (94,3) 
and (94.2) takes the form 


dT a 
ὌΠ 7 0;q". (94.4) 


Equations (94,3) and (94.4) are the equations of motion of M in 
V,, 4. If the motion of S takes place in the absence of applied forces the 
n components y; of the acceleration of S are zero, but ygis, in general, 
different from zero and the corresponding trajectories of M in Vn+t 
cannot be interpreted in a simple geometrical manner. 


95. Systems with a potential function. Suppose that the forces acting 
on 2 are derivable from a potential function U(q°,q',...,q”) which is 
explicitly time dependent. We know that if we introduce the Lag- 
rangian of the system 

L=T-U, 
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then the equations of motion of 2 can be written 
Ἐς 
dt\oq) θα! ἡ" 
Instead of ds? = 2T dt’, it is now convenient to introduce the metric 
do* = 2L dt’. (95.1) 
Absorbing the function U in the coefficient a9 we write 
21, = αφρβᾳ “4, 
so that do” = ayp dq* dg? 


All the formulae established in §93 and §94 remain valid in terms 
of the metric (95.1) if we replace T by L and write O; = 0. We thus have 


{8 a 
Ya = dt ὃη ἃ ϑη 
and the equations of motion of the representative point M in the 
space V,,,.; with the metric (95.1) can be written 
aL 
= 0, = —, 95.2 
96. Eisenhart’s theorem. Eisenhart has given a simple geometrical 
interpretation of (95.2) by introducing a Riemannian metric with 
(n+2) dimensions. If u=q"*! denotes a supplementary parameter, 
consider the improper Riemannian metric defined by 


dr* = 2Ldt*+2dtdu, (96.1) 

giving 
dr* = ajdq' dq/ + 2ajydq' dq° + ago(dq°)* + 2dq°dq"*'. (96.2) 
Among the geodesics of such a metric there are some which are 
real and of zero length, that is with ἐτ = 0. For simplicity we shall 


consider only these geodesics in what follows. If we write the metric 
(96.2) in the condensed form . 


τ = a4pdq4dq® (A,B,etc. = 0,1,...,2+1) 
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with 
Qnti=1, απτι = ἄπττπτι = 9, 


then the system of differential equations (96.2) which determine the 
geodesics may be written 


228 Ὁ 
aap τ = +[BC, 7 ας = 0, (96.3) 
When A = +1, we have simply 
αι dt 
Pie 0, Fr a (a= const.). (96.4) 
Using (96.1) and (96.4) it follows that 
ἐμ 1(dr\? 1 
dt = ἘΠ) -ἃ ΞΞ 5222 (96.5) 
and, by integration 
t 
uo ΕΞ | Ldt+b (ὃ = const.). (96.6) 
0 
The differential system (96.3) may then be written 
«2 
a43—> of + thc, a 3 = 0, 
When A = i we get 
γιξθ 
and, for A = 0, 
d7u 
aa? Vo = 0 
or, using (96.5), 
_ aL 
Oo dt’ 


and we recover (95.2) which are the equations of motion of the 


system 2. A consideration of the initial conditions enables us to 


deduce the following theoremt: 


sagas i “Dynamical trajectories and geodesics’, Ann. of Math. 
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EISENHART’S THEOREM: All possible motions of a holonomic 
system © which admits a potential function can be obtained in the 
following way: consider the metric (96.1); the possible motions of 2 
correspond to the geodesics of this metric which satisfy the initial 


conditions 
. du 1 
Up = δ, (=), mir τα (L)o, 


where a and b are arbitrary constants. 

The trajectories of the representative point M of V,,,; are given in 
configuration space-time by the projections of the geodesics of 
(96.1) along the coordinate lines of the parameter u. The relation 
(96.6) shows the connection between the variable u and the action of 
the system. We remark finally that, given Eisenhart’s theorem, the 
Hamilton-Jacobi theorem of analytical mechanics can be interpreted 
in a particularly simple manner. 


11. DYNAMICS OF CONTINUOUS MEDIA 


97. Continuous media. In a later part of this chapter we propose to 
indicate some applications of tensor calculus to the dynamics of 
continuous media. This includes the hydrodynamics of fluids as well 
as the study of the deformation of solids (theory of elasticity). 
Historically, it was the work of the physicist Voigt on the deforma- 
tion of crystalline media that gave rise to the idea of tensors and it is 
to the theory of elasticity that they owe their name. This, of all the 
classical theories, is probably the one where tensor methods have 
proved most useful, thanks to the easy introduction of those curvi- 
linear coordinates which are most appropriate to the physical 
problems under consideration. Lastly, the general equations for 
continuous media which we shall establish are very important 
because of Einstein’s use of them in general relativity theory. 

From the microscopic point of view all physical media consist of 
particles. It is possible, however, to adopt a macroscopic viewpoint 
and to describe the behaviour of a continuous medium such as a 


+ See BriLtoutn, L., Les tenseurs en mécanique et en élasticité (Masson, 
1938), on this point. 
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fluid or elastic solid by considering a small volume of the medium 
rather than the individual particles. In the course of time, particles 
enter and leave such a volume element, each particle obeying the 
usual laws of mechanics. It is necessary to reformulate these laws in 
such a way that the positions and velocities of the individual particles 
do not appear. The volume elements considered must therefore con- 
tain a sufficiently large number of particles in order that the mean 
values of these quantities should be well defined. If the space is 
referred to three arbitrary curvilinear coordinates ν᾽, y”, y? we assume 
that it is possible to define a density of matter p and a vector velocity 
vat the point M(»’) and at the instant 1. The variables (y', y?, y3, are 
known as the Euler variables. 


98. Total and partial time derivatives. Let us consider a function of 
position, g, in a continuous medium (for example, a scalar such as p, 
or a component of a vector such as v, etc.) and suppose that we wish 
to describe its evolution in time. This can be done in two ways: 
either we describe its evolution in time at a fixed point M(y') so that 
the derivative of q with respect to time is the partial derivative given by 


éq (*) 
ot ot paloauee 
or we can refer this evolution to a coordinate system which is tied 


locally to the average motion of the matter. In the latter case the time 
derivative of g is the total derivative 


dq _ &q ὃ dy’ ὃ ἀν’ @q dy’ 
dt dt dy'dt ay dt éydt 


(98.1) 


Introducing the contravariant components of v defined by 


and using the usual notation for partial derivatives, this relation 
becomes 
dq ὃῳ i 
—=— 0;q. j 
dt 7 lg iW (98 2) 
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If g is a scalar (98.2) can be written in vectorial form as 


dq_ ὃᾳ 
ae ΩΝ gradq. (98.3) 


99. The continuity equation. The two purely kinetic quantities, p and 
y, are not independent since the change of density in a volume element 
is determined by the flux of matter which crosses the surface of this 
element. This is expressed mathematically by the well-known con- 
tinuity equation 


ὃ 
τ div (py) = 0, (99.1) 
which may be expressed in tensor form as 
ὃ 


where the covariant differentiation is performed in a Euclidean space 
of three dimensions referred to the curvilinear coordinates γ᾽, 


100. Body and surface forces. In order to deal with the dynamics of 
continuous media it is necessary to investigate the forces which are 
exerted on the volume elements of the medium. Draw a closed surface 
S inside the medium and consider the external forces acting on the 
volume, V, of the medium enclosed by S. These forces fall naturally 
into two classes: 

(1) Body forces: these act directly on the different volume elements 
of V. Choose an element dV of V with mass dm= pdV. The forces 
acting on this volume element are of the order of dm and hence dV; 
we denote their resultant by 


fdV, 


where f, in general a function of position, is the body force per unit 
volume. 

(2) Surface forces: these are the forces acting on the surface which 
arise from the action between the elements of the medium contiguous 
to S on the outside and on the inside. If dS is a surface element then 
it is acted upon by a force of order dS which we denote by 


—Tds. 
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This force depends only upon the position of the element dS in the 
medium and not upon the rest of the surface which contains it. It is 
generally inclined at an angle to the normal of the surface element dS; 
when directed towards the interior of S it is called a compression and 
in the reverse case, a tension. 


101. The stress tensor. We now propose to investigate the variation 
of the force TdS with the different possible orientations of dS. 

To this end we refer the continuous medium to a frame (O, x!, x?, x3) 
formed by three rectangular Cartesian coordinate axes. At any point 
M of the medium consider three lines parallel to the axes and take 
three infinitesimal elements along these. In this way we obtain an 
elementary tetrahedron MABC whose faces MBC, MCA, MAB, 
ABC will be denoted respectively by dS?3, dS*!, dS'* and dS. If 
%1, ἄχ, %; denote the components of the unit vector along the normal 
to dS exterior to the tetrahedron we have 


ds?3 = αἹ dS, ds! = a dS, ds‘? = a3dS. 


The surface force acting on ABC differs only by higher order terms 
from that which would be exerted on a similar parallel surface centred 
at M. Consequently we denote by Td the surface force exerted over 
dS by the interior elements of the tetrahedron. The equality of action 
and reaction implies that the force exerted over dS by the external 
elements will be --Τ 5. ; 

The tetrahedron is therefore subjected to the body force fdV and 
to the surface forces which act on its four faces and may be represented 
by 

@,,dS*, @3,dS*', @,.dS'?, —TdsS. 


If we add to these the inertial force —ydm corresponding to the 
acceleration of the point M of the medium, the elementary tetra- 
hedron will be in equilibrium under the action of the complete system 
of forces. Hence 


—TdS+0,,dS*7+0;,dS*!+0,,dS'*+(f—py)dV = 0, 
and dividing by dS, 


dv 
T = «93+ %2 O31 3,2} {{-- ργ) τς’ 


Tensor calculus and classical dynamics 115 


Suppose that the lengths of the edges of the tetrahedron tend to 
zero, then dV/dS tends to zero, and in the limit we have 


T= αι Oo3+ α,03, + 03 94>. (101.1) 
We see that, if dS?*, 455}, dS’ represent the components of the 
bivector? defining the area dS, then 
Tds = @,,dS* + 03; ds?! + 8,.ds". 
Hence, in our coordinate system x‘, the components of the force TdS 
are linear functions of the components of the bivector defining the 
surface element dS. This result is obviously independent of the 
particular coordinate system used. 
Let us now refer the space to an arbitrary system of curvilinear 
coordinates γ᾽, The contravariant components of the bivector defining 
the element dS are denoted by dS” and the contravariant components 


of the force TdS are denoted by Τ 415. According to the above result 
there exists a set of quantities Θ᾽, such that 


T'dS = θ᾽ 5, 
In place of the bivector dS" it is convenient to introduce the adjoint t 
vector whose covariant components do, are defined by 
dS¥ = εἰῦ “σι. (101.2) 


It can immediately be verified that (ἐσκ) is orthogonal to the surface 
element dS and has the same magnitude. It follows that 


T'dS = 40in*" doy, 
and writing {πὶ = poh nk, 
we obtain the fundamental relations 
T'dS = 1™do,. (101.3) 


It is clear that the quantities r*! are the contravariant components 
of a tensor since, whatever the numerical values of the covariant com- 
ponents do;,, the left-hand side of (101.3) is a contravariant vector. 


T Defined in § 47. (T.) t Defined in §52. 
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The tensor ¢ so defined} is known as the stress tensor of the contin- 
uous medium. 
When the medium is a perfect fluid the stress tensor takes the form 
gk! = pg*! (101.4) 
where p is the scalar pressure in the fluid at the point and time in- 
stant considered and where the g*! are the components of the 
fundamental tensor of the space. 


102. General dynamical equations for continuous media. In the follow- 
ing we shall assume that all the quantities introduced are both con- 
tinuous and differentiable. 

Consider again an arbitrary closed surface S within the medium 
containing a region of volume V and write down the usual conditions 
for the resultant of the exterior forces and the forces of inertia acting 
on this region of the medium to vanish. It is convenient, particularly 
for writing down the moment conditions, to use an orthogonal 
rectilinear system of coordinates x’. 

Let γ᾽ be the components of the acceleration of the centre of mass 
M(x") of an element of volume of V; the corresponding inertial force 
acting on the element has components — py'dV. If TdS represents the 
surface force exerted by the elements interior to S on the elements ex- 
terior to Sit follows from the vanishing of the resultant force on Vthat 


| | { (f'—py))dv— i) | tKido, = 0 (102.1) 
V 5 
and, since the moment of the resultant couple also vanishes, 
| Jf py) -F'-pylav— 
V 
J [ Ϊ (αὐ t—~xJ1)de, -- Ο.« (102.2) 
5 


Using Green’s theorem to transform the surface integral appearing 
in (102.1) we obtain 
[ [ Ϊ (f'—py'—a,t") ἂν -- 0 (102.3) 
V 


+ Note that in many English texts the stress tensor is defined with the 
opposite sign. (T.) 
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and (102.2) becomes 
{{{πἰὖ -ογ’-- 240%) py'— a0") av— 
V 


4 [ Ϊ [ (-1r)dv=0. (102.4) 
V 


Since (102.3) holds for any volume V it follows from the continuity 
of the integrand that this is identically zero, hence 

py! = fi—a, εἰ, (102.5) 
Using this relation (102.4) reduces to 


| | | (t/—t)dv = 0, 


for an arbitrary volume V so that 
ε".- μὲ = 0, (102.6) 
Equations (102.5) and (102.6) have been derived from the laws of 
classical mechanics. The first set comprise the general dynamical 
relations for continuous media, the second merely state that the tensor 
t” is symmetric with respect to its two indices. 
It is easy to express the dynamical equations for continuous media 
using an arbitrary curvilinear coordinate system. The equations 
py' = [΄-ἰν (102.7) 
are invariant with respect to any change of coordinate system; they 
also reduce to the equations obtained previously when using rect- 
angular axes. They are therefore the desired generalization of (102.5). 
We note that to any stress tensor r’/ there corresponds a force density 
per unit volume given by 
K'=V,¢*4. (102.8) 


103. Alternative form of the equations for continuous media. Equations 
(102.7) can be put into an interesting alternative form by writing out 
the acceleration components γ᾽ explicitly. These components are 
simply the absolute total derivatives with respect to time of the 
velocity vector v. Using (98.2), we have 


οἱ dv! 
γ' = at nvie = mr +I;/,v0%v* +0* διυ!, 
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a i 
or _ Stet (103.1) 
We deduce that 
i 
v ὃ 
py! = eae δ νι (ρυξοῦ ΠΕΣ vitor], 

et δὲ 
and, since the bracketed term is zero by virtue of the equation of 
continuity, 

i 
py' = = +V,(pv*v'). 


We thus obtain a new formulation for the general dynamical 
equations for continuous media: 
A(pv') 
ot 


+V (pv*o'+ rt) = εἰ (103.2) 


It was this formulation which suggested some of the postulates of 
general relativity to Einstein. 

The three equations (103.2) together with the equation of con- 
tinuity determine the behaviour of continuous media under the action 
of surface and other forces. The mechanical problem is, of course, 
only defined when the r’ and the f! are known. The body forces are 
determined by external circumstances such as the presence of a 
gravitational field, whilst the stresses depend upon the internal 
deformations of the medium and on the flux of matter. In the case of a 
fluid, for example, 


lik = P&ix 


where p is itself the function of p and temperature given by the 
equation of state of the fluid. 

The above equations can be used to derive an equation representing 
energy transfer which expresses the fact that the energy contained in 
an element of volume varies with the flux of energy passing through 
the surface of the element. 


CHAPTER VII 


Special Relativity and Maxwell’s Equations 


I. PHYSICAL PRINCIPLES 


104, The Michelson-Morley experiment. A systematic treatment of 
special relativity theory would take us beyond the scope of this book. 
In this chapter we merely propose to indicate the bases of the theory 
and to show the role played by tensor methods, particularly in the 
theory of electromagnetism. 

Historically, the theory of special relativity was a consequence of 
the negative result of the Michelson-Morley experiment. Various 
experimental facts had pointed to the existence of an ether at absolute 
rest, which did not affect the motion of ordinary matter, and which 
was the medium in which electromagnetic waves were propagated. 
Among these facts was Fizeau’s experiment on the velocity of pro- 
pagation of light in a material medium and the phenomenon of 
aberration of light waves. 

This picture of a stationary ether seemed to make it inevitable that 
the value of the speed of light measured by an observer in motion with 
respect to the ether would depend upon the magnitude and direction 
of his velocity. If c is the velocity of light and v is the collinear velocity 
of the observer with respect to the ether then, according to classical 
kinematics, the velocity measured by the observer would be (ὁ -- Ὁ) or 
(c+wv) according to whether he moves in the same or the opposite 
direction as the light. An observer ignorant of his velocity relative to 
the ether would be able to determine this experimentally by emitting 
light signals in all directions and measuring the time taken by these 
signals to reach the surface of a sphere centred at the light source. 
Any motion with respect to the ether would create an ‘ether wind’ 
which would affect the signals in such a way that the first one to reach 
the sphere would have travelled along the direction opposite to the 
motion and the last one would have travelled in the same direction 
as the motion. 

[119] 
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This was the principle of the celebrated Michelson-Morley experi- 
ment which attempted to find the motion of the earth with respect to 
the ether. The velocity of the earth in its orbit is about 30 km/sec. 
with respect to the Copernican reference frame which has its origin at 
the centre of gravity of the solar system. After six months this velocity 
is reversed. It might be that, at a certain instant, the unknown motion 
of the Copernican axes with respect to the ether exactly cancels the 
relative motion of the earth but this could hardly continue throughout 
the year. 

Using a well-known interferometer whose description we shall 
omit, Michelson and Morley could have detected an ether drift of only 
1-5 km/sec. In fact they observed no fringe shift within the limits of 
experimental accuracy and the same experiment done at different 
periods of the year always gave the same negative result. More recent 
experiments have confirmed the original result. 

Experiment thus shows that the speed of light is independent of the 
motion of the observer. 

Various artificial hypotheses (the ether drag theory, Ritz’ theory of 
the dependence of the speed of light on the motion of the source) were 
put forward to explain this negative result. These were subsequently 
vitiated by further experimental facts. However, it was the hypothesis 
of the contraction of bodies in motion in the quantitative form put 
forward by Fitzgerald which led physicists, notably Lorentz and 
Einstein, to formulate the special theory of relativity. 


105. Constancy of the speed of light. Lorentz and Einstein took the 
results of the Michelson-Morley experiment as their starting point. 
Since a Galilean frame is a system of axes moving uniformly along a 
straight line with respect to the Copernican frame the result of this 
experiment can be stated as follows: the speed of light is the same with 
respect to all Galilean frames which coincide for short periods with 
the system of axes fixed on the earth as it describes its orbit. 

We are thus led to postulate the principle of the constancy of the 
speed of light: 


+ These were by Kennedy (1926), Piccard and Stahel (1926-8), Joos 
(1930). Indirect confirmation was given by the observations of Ives (1938) 
on the Doppler effect. 
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The speed of light in vacuo measured with respect to any Galilean 
frame is the same in all directions; this speed, of the order of 300,000 
km/sec, will be denoted by c. 

There might be disadvantages in basing a principle of such general- 
ity on the result of a single type of experiment without the confirma- 
tion of other types of experiment. However, the Michelson-Morley 
experiment merely served to draw the attention of physicists to a 
mathematical fact which had, until then, remained obscure. It had 
already been pointed out by Poincaré that the equations of Newtonian 
dynamics and Maxwell’s electromagnetic equations are not invariant 
under the same group of transformations. An intrinsic conflict there- 
fore exists between Newtonian dynamics and electromagnetic theory. 

In order to resolve this conflict Einstein proposed to admit the 
principle of the constancy of the speed of electromagnetic waves, 
thus retaining the Maxwell theory, and to modify Newtonian 
dynamics accordingly. 


106. Newton’s and Einstein’s principles of relativity. In order to 
improve our understanding of the above-mentioned conflict let us 
return to Newtonian mechanics. Consider any two Galilean frames 
whose relative velocity is represented by v. We can arrange, by a 
suitable choice of coordinates, that the axis O’ x’ of the second frame 
slides along the axis Ox of the first, the axes O’ y’ and O’z’ remaining 
parallel respectively to the axes Oy and Oz of the first frame. Using 
classical kinematics, we have that, if (xyzf) and (x’ y’z’t’) denote the 
coordinates and time of the same event with respect to the two 
Galilean frames, then 


x’ = x-vt, y =y, ζ' = Z, =f, (106.1) 


where we have chosen the same origin for ¢ and ζ΄, Combining 
(106.1) with transformations which consist of pure spatial translations 
and rotations combined with changes of the time origin we obtain the 
most general transformation in classical mechanics, relating the 
coordinates and time of an eventin a Galilean frame to the coordinates 
and time of the same event in any other Galilean frame. These trans- 
formations form a group which is known as the classical Galilean 


group. 
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In Newtonian dynamics the various material points M;, M> are 
subject to forces which are derivable from a potential function which 
depends only on the relative distances rj, 713 etc. of the points at the 
same instant of time. If y; denotes the acceleration of M; with respect 
to a Galilean frame the corresponding equation of motion of Μ᾽ may 
be written 


mY, = gradgy,y®, etc.; Φ = D(r1>,r;3, εἰς). 


Changing to another Galilean frame by means of a transformation 
in the classical Galilean group we see immediately that 


Y1 = Υἱ, etc., r12 = rj2,ete., 
and in the new frame the equations of motion become 
mY} = gradi.) ’, etc. ; D’ = P(r {>,r{3, etc.). 


The Newtonian equations of motion are thus invariant with respect 
to the classical Galilean group. We conclude that, according to 
Newtonian mechanics, no purely mechanical experiment carried out 
within a Galilean system can determine the motion of this Galilean 
system with respect to any other Galilean system. 

This constitutes Newton's principle of relativity. This principle does 
not include electromagnetic phenomena since Maxwell’s equations 
are not invariant with respect to the classical Galilean group. 

The Michelson-Morley experiment and the principle of the con- 
stancy of the speed of light led to the formulation of Einstein’s 
principle of relativity: 

No physical experiment — mechanical or electromagnetic — carried 
out within a Galilean system can be used to demonstrate the motion of 
this Galilean system with respect to any other Galilean system. 

The idea of an ether at absolute rest thus loses all significance and is 
eliminated. All the equations of physics, referred to Galilean frames, 
must be invariant with respect to certain transformations which, in 
particular, leave Maxwell’s equations invariant. These transforma- 
tions form a group, known as the Lorentz group, which we now pro- 
pose to study. 
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II. THE LORENTZ GROUP AND 
MINKOWSKI SPACE-TIME 


107. Space-time. When we were studying dynamical systems with 
time-dependent constraints in the previous Chapter it was found con- 
venient to use the configuration space-time Κ᾽. ;. Similarly, in order 
to represent mechanical and electromagnetic phenomena taking 
place in the world, it is natural to introduce a continuum V4 having 
four dimensions, three spatial and one temporal, each point of which 
corresponds to an event. This continuum is called space-time. 

The space-time continuum can be referred to arbitrary systems of 
curvilinear coordinates. If (y', y”, y>,»*) denotes one such coordinate 
system it is possible to make coordinate transformations of the type 


y* = f(y" y?, yx") (« = 1,2,3,4), (107.1) 


where the f® are restricted only to the extent of the usual conditions 
of continuity, of single-valuedness and of differentiability. 

In special relativity theory we assume that it is possible to refer at 
least part of this continuum to a Galilean coordinate system defined 
in the following way: let (x, y, z) be the coordinates of a point in space 
with respect to a rectangular Galilean system of axes. The principle of 
the constancy of the speed of light provides us with a natural time 
scale which allows the definition of a time ¢ associated with an event 
occurring at the point (x, y, z). The coordinates (x, y, z, 2) are said to be 
the Galilean coordinates of V,. We propose to find the transforma- 
tion formulae 


x = x(x’y'z'r/) x’ = x'(xyzt) 
= x’ , 2’ 1) a x zt 
y = yx’ y y = y (xyzt) (107.2) 
z= 2zx'y'z'r/) ‘= z(xyzt) 
t= tx’y’2’r’) = t(xyzt) 


which enable us to pass from one system of Galilean coordinates 
(x, y,z,f) to another (x’, y’,z’,/’). 
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108. The Lorentz group. We now consider two Galilean coordinate 
systems and express the infinitesimal displacement of an electro- 
magnetic wave with respect to each of them. 

In terms of the first system of rectangular coordinates, the spatial 
line element is given by 


do? = dx*+ dy*+ dz’. 

Consider, in this system, two neighbouring events (x,y,z,) and 
(x +dx, y+ dy,z+dz,t+ dt). If these two events define the infinitesimal 
displacement of an electromagnetic wave we have, according to the 
principle of the constancy of the speed of light, 

do” 

de = c?, 

t 
Hence, for any infinitesimal displacement of an electromagnetic wave, 
the expression 
ds? = ¢*dt?—do® = c*dt?—dx?—dy?—d2 (108. 
is zero. The same expression, evaluated in the second coordinate 
system, 
ds’? = ¢*dt’*—do” = ce? dt’* — dx’? — dy’? — dz’? 

is also zero. Consider now an arbitrary infinitesimal displacement ata 


point M in the two systems. A reasonable generalization of the 
relation ds’* = 0 = ds? is given by 


ds’? = f(M)ds, 
where ΚΜ) is a function of the coordinates of Mand may also depend 
on the relative velocity v of the two coordinate systems. More 
explicitly 
c* dt’? — dx’? —dy’*—dz’? = f(x,y, z, )(c? dt? —dx?—dy?—d2), 
(108.3) 
When the relative velocity of the two Galilean systems is zero the 
function /(M) must reduce to unity. It can be shown that the only 
physically significant solutions of (108.3) are those in which f(M) is 
always unity. Equation (108.3) therefore becomes 


c* dt’? — dx’? — dy’*—dz’? = ¢* dt? —dx*—dy*—dz._ (108.4) 
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We are thus led to consider all coordinate transformations (107.2) 
which satisfy the condition (108.4) and therefore leave invariant the 
differential quadratic form defined by (108.1). We also require them 
to reduce to a transformation consisting of a pure spatial displace- 
ment and a change of time origin when v=0. It is immediately 
obvious that these transformations form a group: if any two transfor- 
mations (107.2) leave the quadratic form (108.1) invariant, so does 
their product; the inverse of such a transformation also leaves (108.1) 
invariant. This group of transformations is known as the Lorentz 
group. 

The determination of the transformations of the Lorentz group is a 
simple algebraic problem, the solution of which is outlined below. A 
geometrical interpretation of the Lorentz group can also easily be 
obtained. Put 


u = ict, u’ = ict’, (i? = —1), (108.5) 
then (108.4) takes the form 
dx’? + dy’* + dz'*+du’® = dx*+dy*+dz*+du*. (108.6) 


The Lorentz group thus corresponds to the group of translations and 
rotations in a proper Euclidean space of four dimensions. This can be 
obtained by combining the translations 


x’=xt+a, y=yth 2=ztce, w =utd, 


with the linear orthogonal transformations (rotations) of the four 
coordinates (x, y, Z,u). 

According to its definition the Lorentz group contains transforma- 
tions which consist of purely spatial displacements (which conserve 
the form dx*+dy*+dz*) and changes of the time origin. We now 
seek a particular transformation based on the trivial solutions already 
obtained which generates the most general transformation of the 
Lorentz group. 

To this end we note that by adding a constant to each of the four 
variables, which is equivalent to a translation of the axes Oxyz anda 
change in the origin of the time ¢, the resulting functions (107.2) 
which are necessarily linear, can also be made homogeneous. This 
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implies that the origin of the first frame at time 2 = 0 coincides with 
that of the second frame at #’ = 0. In a rectilinear translation of the 
second frame with respect to the first, a straight line fixed in the 
second frame and parallel to the relative velocity slides along a 
straight line fixed in the first frame. By a suitable choice of the spatial 
coordinates we can arrange that the O’ x’ axis slides along Ox in the 
course of the motion, the directions of these two axes then being that 
of the relative velocity. Rotating O’x’y’z’ about O’x’ we can make 
O’ γ΄ and O’ z’ parallel and in the same sense as Oy and Ozrespectively. 
This means that Oxyz at time t = 0 coincides with O’ x’ y’z’ at time 
=0. 

Returning to the four-dimensional Euclidean space referred to the 
orthonormal frames with coordinates (x, y,z,u) and (x’,y’,z’,u’), we 
are thus led to study rotations which simultaneously leave invariant 
the plane defined by y=0 or y’ = 0 and the one defined by z= 0 or 
z’ = 0; that is to say they leave invariant the orthogonal coordinate 
axes corresponding to the variables z and y. We conclude, as in the 
three-dimensional case, that 


y=y, w=z (108.7) 
The identity (108.6) now reduces to 


dx’*+ du’? = dx*+ du’, (108.8) 


and the transformation only concerns the two variables x and u. 
Since the origins of the coordinates (x,u) and (x’,u’) coincide, the 
identity (108.8) defines ordinary rotations in a plane 


x’ = xcos¢+usin¢d, ace 


/ 


μ΄ = —xsin¢d+ucosd, 
where ¢ denotes an arbitrary angular parameter. Since u and uw’ are 


pure imaginary quantities, sing is pure imaginary and cos¢ is real. 
As ἴδῃ ᾧ is pure imaginary and dimensionless we write 


iv 
tan =—_ —) 
φ c 
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and substitute for ¢ the real parameter v having the dimensions of 
velocity. It follows that 
Σ: 1 siaidinn i(v/c) 
~ Vf (1-2 /e)" ~ V(1-2/c?) 
Substituting these expressions into (108.9) and reintroducing the 
variables ¢ and ?¢’, we obtain 


cos ¢ (108.10) 


x—vt t—vx/c? 


“να -οξοῦ SV -υῇον᾽ 
We have thus established that the most general transformation of the 


Lorentz group is obtained by combining the trivial transformations (pure 
spatial rotations and translations) with the special transformation 


pri x—vt — ‘= , _ E~BG/o) Ὁ 
sacl. peneblaarens κόρα 1 G- ( ) 
(108.11) 


x 


x 


This is known as the special Lorentz transformation. 

It is obvious from (108.11) that ὁ which, for physical reasons is 
necessarily smaller than c in absolute value, is such that every point 
fixed in the second Galilean frame (the origin, for example) is moving 
relative to the first frame with uniform velocity parallel to Ox. We 
note that if β =wv/c becomes negligibly small (108.11) reduces to 
(106.1) which was used to define the classical Galilean group. When 
v = 0 the transformation (108.11) naturally reduces to the identity 
transformation. The inverse formulae to those in (108.11) are given 
by 


x= ate y= y’ z=2' δ t+ B(x’/c) 
V(1—B?) ᾿ ' νᾳ --βἢ 
(108.12) 
which only differ by the interchange of variables and the replacement 
of v by — v. 


It would take rather long to verify directly that Maxwell’s equa- 
tions, in their usual form, are invariant with respect to transformations 
of the Lorentz group. This result is, however, a trivial consequence of 
the simple tensor form into which we shall now put these equations. 
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109. Lorentz transformation in vector form. It is convenient to 
rewrite the special Lorentz transformation in three-dimensional 
vector notation. 

We denote the velocity of the second frame with respect to the first 
by v and the unit vector along Ox (and in the direction of vy) by i, 
hence 

Υ͂ = vi. 


Let r represent the position vector defining the point (x,y,z) with 


respect to the origin O of the first frame and r’ represent the analogous 
quantity in the second frame. The vectors r and r’ may be written as 


r= xi+nr, 
(109.1) 
r= x’ i+rj, 


where r, and r; represent the components of r and r’ perpendicular to 
i. Using the Lorentz transformation relations (108.12) we have 


_ ἀν 
αν 4- Ὁ 


We now wish to evaluate r in terms οὗ r’, v and 7’. Combining 
equations (109.1) we have 


x Γι ΞΕ Τί. 


r=r+(x-~’)i. (109.2) 
However 
ais γ5 x’ το aeae ( 1 -1)x'+ vr 
V/(1—?) V(1—p?) V(1— 2) 
and xis (δὲ εε v5: 


Substituting these results into (109.2) we get the transformation 
formula for position vectors : 


vi’ 


1 Υ͂ 
Γ--Γ:!------5---1}τ΄ .ν-ς-----ς- .-. 109.3 
ΕΣ ) υ2 /(1—f?) Coe 
It is apparent that the three vectors r, r’ and v are always coplanar. 
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Using (108.12) the time transformation formula can obviously be 
written 
_ t+ -y)/c? 
νᾳ--βὴ 


The- transformation equations in their vector form (109.3) and 
(109.4) are useful in a great number of problems in special relativity. 
We shall now use them to establish Einstein’s formula for the com- 
position of velocities. 


(109.4) 


110. Relativistic composition of velocities. The composition of 
velocities in classical kinematics is a simple matter: if y denotes the 
velocity of a Galilean system S’ with respect to S and if a material 
point P moves with velocity u’ relative to S’ then its velocity relative 
to S is given by 

u = vy+u’ (110.1) 


In special relativity the relation between u, v and wu’ is much more 
complicated. Using the notation of §109 the velocities of P with 
respect to the two Galilean systems are given by 


Differentiating (109.3) with respect to t we obtain. 


dr 4 1 a ν Υ dt’ 
at = ΕΣ (5 —p2)_ (5) a+ aoe ΞΡ ( 10.2) 
and differentiating (109.4) with respect to 79). gives 


dt _ 1+(u'-v)/c? 
dt -(1—B?) 
Substituting this expression for dt/dt’ in (110.2) we find 


1 


- 1+(u’:v/c?) 


(va - β  " ἘΠ -- νῷ -ρλη τονε) 
9 
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which shows that the three vectors u, u’ and y are coplanar. This 
expression is easily transformed into the formula for the composition 
of velocities, giving 


1 αν , wv 
u= πωπαβ' ἐπ νι να -βῦ -1}}} (110.3) 


in which the numerator is the sum of two vectors, one parallel and one 
perpendicular to v. If 8 = v/c is negligible compared to unity, equation 
(110.3) has the limiting form of the classical relation (110.1). An 
immediate consequence of (110.3) is the expression for the square of u 


(α΄ - vy)? 203 


ρος } (110.4) 


1 
uw = fit wwe) Ju? ευλελυ νι 
which is symmetrical in u’ and ν. 

When the vectors π΄ and vare collinear u is also in the same direction 
and the second term in the numerator of (110.3) is zero. We then 
have the Einstein relation between the algebraic values of the three 
velocities 
¥ u'+v 
~ 1+(u'v/c*)’ 


It is easily seen from (110.4) or (110.5) that it is impossible to exceed 
the speed of light by combining two velocities less than or equal to c. 
In particular putting uw’ = c in (110.4) gives 


“ 


2 , 2 
ΡῈ SY Pa) ae 
| ep! =) 7 


Thus, when w’ = ς, it always follows that u = c whatever the value of 
y. This result obviously agrees with the principle of the constancy of 
the speed of light. 


111. Minkowski space-time. Equation (108.4) is equivalent to saying 
that two neighbouring events (x, y, z,f) and (x + dx, y+ dy,z+dz,t+dt) 
define an expression 


ds* = c* dt? —dx?—dy*—dz’, (111.1) 


(110.5) | 
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which has the same value in all Galilean coordinate systems. We note 
that, if the two events correspond to a displacement made with a 
speed less than c, then ds? is positive. 

We are thus led, in special relativity, to associate the continuum 
V, with the metric defined by the quadratic form (111.1). Since this 
metric has constant coefficients with respect to the Galilean variables 
and a signature(+ — — —), it therefore defines V4 as an improper 
Euclidean space. This space is known as Minkowski space-time and 
ds is said to be the infinitesimal element of a finite interval between two 
events. 

The Galilean coordinates constitute an orthogonal rectilinear 
coordinate system for this space. We obtain an orthonormal frame in 
V, if we substitute the variables 


x? = z, 
for (x, y, 2, ἢ). The metric (111.1) then takes the form 
ds = (dx*)? a (dx')?- (dx?)? — (dx*)?, 


The variables x* are called reduced Galilean coordinates. We shall 
make frequent use of them in what follows. It is convenient to intro- 
duce the notation 


ds? = ngdx*dxP (a,B,etc. = 1,2,3,4) 


x'=x, x*=,y, x*=ct, (111.2) 


(111.3) 
where 
Nap = 0 (α τὶ β), 


Naturally, Minkowski space-time can be referred to any system of 
curvilinear coordinates. Its metric is then written as 


ds* = Sag dy dyP 


where the gg are functions of the curvilinear coordinates y*. 


N44 = πῆιι = —22 = —733 = I. 


Ill. DYNAMICS IN SPECIAL RELATIVITY 


112. The unit velocity vector and the principle of inertia. Consider a 
moving mass point P in Minkowski space-time whose velocity is, of 
course, less than c. The motion of P can be defined by giving the 
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coordinates y* as functions of a single parameter. We choose the 
interval s measured along the trajectory of Pin V4. The vector 


α 
u* = = (α = 1,2,3,4) (112.1) 


which is obviously of unit length, is called the unit velocity vector. 

Let us express the components of this vector in terms of reduced 
Galilean coordinates. The components of an ordinary spatial velocity 
vector are 


i 
οἱ = = (i, etc. = 1,2, 3). (112.2) — 
We conclude that 
je _ et 
“= ae 
τ δϑέδον Abe 
μ᾿ Ξ = OO 
ds ds 


However, if v = Bc denotes the magnitude of the velocity (112.2), we 
can rearrange (111.1) to give 
et tee net an 


It follows that 
μἷ - μάτι τος τἀ εὶς οὐδεν 
εν -- βἢ V/(1—B’) 
Let the motion of P in Galilean coordinates be uniform and 
rectilinear. The components οἷ, and consequently the u%, are then 
constant. We therefore have 
dui εἰμ 
— = —= 0. . 112. 
a 0, 7s (112.5) 
The world line of P is therefore a geodesic of (111.1) with positive ds’. 
On the other hand the differential system describing the geodesics of 
(111.1) may be written in Galilean variables as 
εχ 20 d*y dz 
Ga at wind: 


(112.4) 
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and each geodesic with positive ds* corresponds to a uniform recti- 
linear motion with velocity less than c. 

In Newtonian dynamics, the principle of inertia asserts that an 
isolated mass point is always in uniform rectilinear motion with 
respect to Galilean axes. In special relativity we preserve this principle 
and reformulate it in the following invariant manner: 

Principle of inertia: An isolated mass point follows a path which is a 
geodesic with positive ds* in Minkowski space-time. 

Geodesics with ds? = 0 correspond to straight lines traversed with 
velocity c, that is to the paths followed by light rays. We thus see that 
a simple relation exists between special relativity and the geometry 
of Minkowski space-time. 


113. The equations of motion. Consider the motion of a mass point P 
under the action of a known force f which varies with the position and 
velocity of P. According to Newtonian dynamics we have 


dy 

m— = f, (113.1) 
where m denotes the mass of the particle and v is its velocity vector in 
space. Using (113.1) we obtain the energy equation 


“(m) = f-y, (113.2) 


The Newtonian equations (113.1) and (113.2) are not invariant 
under the Lorentz group which transforms from one Galilean 
coordinate system to another in relativity theory. They must there- 
fore be modified in order to comply with the demands of special 
relativity. In this process we shall be guided by the requirement that 
the new equations should reduce to the old ones for small velocities. 

Equations (112.5) represent the motion of an isolated mass 
point in relativistic form. They suggest that the relativistic equations 
of motion should be formulated in terms of du“/ds rather than the 
classical acceleration vector. In arbitrary curvilinear coordinates the 
relativistic acceleration is given by the expression 

u* 


Vv 
ne ee (a = 1,2,3, 4). 
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Let us characterize the inertia of a point P by a parameter 7710, 
having the dimensions of mass, which we shall call the rest-mass of P. 
We postulate the following form of the equations of motion 


Vv α 
myc? = b% (113.3) 


where ®* is the relativistic generalization of the Newtonian force 
vector. The coefficient εἶ has been introduced for reasons which will 
appear shortly. The equations (113.3), which are the equations of 
motion in special relativity, were first given by Minkowski. 

Since μ is a unit vector its absolute differential Vu* is orthogonal 
to it. Then according to (113.3) 


O%y, = 0. (113.4) 


Thus the vector ®* is always orthogonal to the unit velocity vector. 

We now use Galilean coordinates and interpret equations (113.3) 
by comparison with the Newtonian equations (113.1) and (113.2). 
In such a coordinate system equation (113.3) can be written 


a 
Mo εἶ ~ = D*, 


and reintroducing the variable 1, related to s by (112.3), we have 
εἰ 
myc = Φ νι -- 3. 


Using (112.4) to evaluate the components μ as functions of the com- 
ponents of the spatial velocity v we find 


ee 
dt\/(1—’) 
ΤΙΣ 

dt\+/(1—B?) 
which bear a simple relationship to the Newtonian equations. 
Denoting the spatial vector whose components are 


fi= @' V(1—f?), (113.7) 


= & VIB?) (113.5) 


= $*,/(1-f?), (113.6) 
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by f, we may rewrite equations (113.5) in ordinary vector notation as 


d Υ 
—| —_——..-] = ἢ ; 
ΡΝ ἢ ; (113.8) 
which recalls the fundamental equation of Newtonian dynamics 
(113.1) and reduces to it when 8 becomes negligible compared to 
unity. 

It is a little more complicated to interpret the right-hand side of 
(113.6). The orthogonality condition (113.4) may be written in 
reduced Galilean coordinates as 


@* ri Ss ! μ', 
i 
or using (112.4) and (113.7), 
pt > Σ fiv' (oie 
V(1-B?)  4+c(l—f?) (1 —-?) 


i 


from which 
O (1p) - —. 

Then (113.6) can be written 
{ἀρ ἐν ἔν (113.9) 
dt\/(1—B) 


The right-hand side of this equation is identical with that of the 

classical mechanical relation (113.2); so the time variation of the 

quantity 

a.) ee 
V(1—B’) 

is equal to the work done by the force f. We are therefore led to 

suppose that (113.10) defines the total energy of the mass point. It is 


important to note that this energy does not reduce to zero as v tends 
to zero. We write 


E (113.10) 


E= E£E)+T, 
where 


Ey=me, T= moe | 5-1} (113.11) 
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Weshallcall Ep the rest energy of the mass point and 7 the relativistic 
kinetic energy. For small values of v 


T= 479 εἶ p? = dor”. 


Hence, for small velocities, E differs only by the constant Ep from the 
ordinary kinetic energy of classical mechanics. 


114. Momentum-energy vector and relativistic mass. The vector 


p* = mgcu", (114.1) 


which is collinear with the unit velocity vector, is called the momen- 
tum-energy vector. According to (112.4) the components of this 
vector are given by 
i 
i υ 4 c δ: 

= 771λ-τ--τ τς τ 9 = ne" ἘΞΞ -τὖ 114.2 
Po ep ase ie" 
in reduced Galilean coordinates. 

Introducing the momentum vector p whose components are p’, the 
fundamental equation (113.8) may be written in the form 


dp 
— = f, ; 
ah (114.3) 
This can be obtained from the Newtonian equation of motion (113.1) 
by replacing mv by p. Thus the point P can be considered to have the 
variable mass 

mo 


~ V0=P) 


This quantity m is known as the relativistic mass of P. For small 
velocities m reduces to the rest mass 770 and, as v approaches the 
velocity of light, m tends to infinity. 

According to (113.10) the relativistic mass is equal to the total 
energy E divided by εἶ while the rest mass is the rest energy divided by 
c”, It thus appears that there is a direct correlation between mass and 
energy in relativity theory which has no analogue in classical physics. 
We now propose to investigate this relationship. 


m (114.4) 
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115. The inertia of energy. Consider a complex material particle 
referred to a Galilean frame So which is chosen in such a way that the 
total momentum of the particle is zero. Suppose also that the particle 
has the total energy Ep relative to So; this is its rest energy.t The 
momentum-energy vector of the particle then has the components 


py = 0, Po = 3? 


with respect to Sp. 

Now consider another Galilean frame S moving with velocity v 
relative to So. Carrying out a Lorentz transformation, we see that the 
components of the momentum-energy vector with respect to S are 
given by 

Per ee: RE Hae) DONE Ἐὰν FT Wh 

Stave "sya hae 

It is clear from these relations that all the rest energy Ey makes a 

contribution to the spatial momentum vector as if it were a mass 
E,/c”. We are thus led to formulate the following principle: 

A system having any form of rest energy Ep has a corresponding 
inertial mass 


(115.2) 


This statement, which is known as the principle of inertia of energy, is 
due to Einstein. The recognition of the equivalence of mass and 
energy is, without doubt, the most fertile contribution that the special 
theory of relativity has made to physics. 

As an example of this principle let us suppose that the particle, 
referred to So, is not subject to any forces and emits an energy 4 Eg in 
the form of electromagnetic radiation during a finite interval of time. 
This might well be in the form of a spherical wave and the total 
momentum with respect to Sp of the energy radiated would then be 
zero. It follows that the particle, initially at rest with respect to So, 
will remain so after emitting the radiation. 


ἡ This includes the interaction energy of the components of the ‘particle’ 
as well as the energy which corresponds to their masses. (T.) 
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Now refer the description to the S frame. Since the particle and the 
radiation constitute an isolated system, there will be conservation of 
the total momentum-energy vector. Hence, the radiation has the 
momentum 

AEy Fi Υ 
εξ ν( --βὃ 
4ΔῈ9 
νᾳ-- βὴ 
with respect to S. The particle has thus supplied this momentum and 
energy without changing its velocity. This is only possible if its rest- 


and the energy 


mass is changed. If 79 denotes the rest-mass of the particle before the 


emission and mg that after emission then we have the conservation 
relations 


i. ον δῶν. τᾶξεν 
ν(--β νᾷα--βδ᾽ εἶ ν(--β5᾽ 
moc? mg εἶ AEo 


----.Ξ--- Ξ- = ----..3.---...---..-.- .. 
ν(-βὴ να-βἢ να-βἢ 
which are both equivalent to 
AE, 
Mo = Mo- —s 
ς 
The mass of the particle has thus been decreased by an amount equal 
to the energy emitted divided by εἶ, 

In Newtonian mechanics there are separate conservation laws for 
the mass and energy of an isolated system. In relativistic mechanics, 
however, there is a single law of conservation for the total energy of an 
isolated system and the rest-masses of the constituents will alter every 


time kinetic energy is transformed into other forms of energy, and 


vice-versa. The rest-mass of a material particle remains constant as 
long as its rest-energy is not altered by such changes. But the rest- 
masses can change appreciably if the interaction energies are of the 
same order of magnitude as the rest energies. In this way it is possible 
to explain the mass defects observed in atomic nuclei and, in a general 
way, the mass losses which occur in most nuclear reactions. 
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The inertia of energy plays an important part in the understanding 
of nuclear phenomena, which are the object of so much research in 
contemporary physics. In these, energy is both materialized in the 
form of elementary particles and liberated by the disintegration of 
matter. 


IV. RELATIVISTIC DYNAMICS OF 
CONTINUOUS MEDIA 


116. Equations for a system at rest. In order to obtain the relativistic 
form of the equations of motion for a continuous medium, we intro- 
duce an orthogonal Galilean frame Sp which is at rest with respect to 
some point Pp of the medium. We then write down the equations at Pp 
in terms of Sp. 

At Pp all the components οἱ of the spatial velocity vector υ are zero; 
on the other hand the derivatives of these components are generally 
different from zero. Denoting the unit velocity vector associated with 
οἱ by μᾶ, it is clear that 


u=0, uw=1 (116.1) 


at Po. It follows, using (112.4), that the derivatives of the u* are given 
by 


I 
δχεΐ = One's é,u* = 0, (116.2) 
since β = 0 at Py and 
u* 0,u, = 0, 


because u* is a unit vector. 
The non-relativistic equations (99,2) and (103.2) take the form 


on ae Ἢ 
oe cet =f", 
(116.3) 
op k 
atone De πα 
rn | i 


in the system So and at the point Pp, where p' denotes the ordinary 
momentum vector per unit volume. 
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Let us consider equations (116.3) from the point of view of mass- 
energy equivalence. The ordinary momentum vector pi = pv' involves 
only the energy corresponding to the matter density p. However we 
know that in relativistic dynamics each form of energy must make its 
contribution to the momentum vector. Here we shall consider, in 
addition, only that energy which derives from the action of mechanical 
Stresses, avoiding, in particular, the case where an electromagnetic 
interaction also occurs. 

In order to evaluate the corresponding flux of energy, consider an 
element of area dS whose ‘components’ are denoted by the do, 
defined by (101.2). From (101.3) there is a corresponding surface 
force 

T'dS = t do, 


If the matter in the neighbourhood of this element of area moves 
with the velocity vy, having contravariant components οἷ, the corre- 
sponding work done is given by 


>) οἱ t kl do ke 

Π 
Introducing the covariant components of the velocity by means of the 
spatial metric — (dx!) — (dx)? — (dx)? this becomes 

—vyl kl dg ke 


In other words, a flux of energy traverses the element of area dS and 
the vector defining this total energy flux has the components 


pc*v'—v,t", 


where the first term corresponds to the density of matter p. The 
relativistic relation between energy and mass [see (115.1)] implies 
that the oe vector per unit volume is the vector given above 
divided by εἶ 


Ϊ 
Ρ' = pv'— κι rssh (116.4) 
The components of this vector are zero at Po, but this is not true of 


their derivatives which alone are involved in the equations of motion. 
In order to obtain the relativistic equations of motion valid at Po in 
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terms of the frame Sp we substitute (116.4) for the vector p in the 
classical relations (116.3). We thus have 


ὃ i 1 il ik i 
oom Geuk ΡΥ ΐ + a i’ = ‘ 
{oo 3U k ts 


ὃ | 
art (pet eit!) = 0, 


and allowing for the fact that the οἱ vanish at Po, 


du! 1 ev; il ik i 
-- — f'+0,t - 
Pat 2 at κα 


Apa | a ΤΩ 

—-5(0,v 
ar ρ κυ 2 KUL 
Let us now replace the derivatives with respect to ¢ by those with 
respect to x* = ct. We obtain, with the usual notation for partial 
derivatives, 


1 
pe Ogu! —— O40) + Op - = Sf 
(116.5) 


1 
οϑερΈρθρυ"--- διυ, 11} ΞΞ 0, 


which are the required equations of motion. 


117. The equations of motion in tensor form. Starting from equations 
(116.5) which are valid for the point Py with respect to the frame Sp 
we can determine the general tensor form of the equations of motion, 
thus translating the classical equations (99.2) and (103.2) into 
relativistic form. 

The appearance of the tensor fr“ in the non-relativistic equations 
(103.2) suggests that a corresponding symmetrical second-order 
tensor must play an important role in the desired relativistic equations. 
We therefore introduce the tensor ΤΣ which has the components 


TF = 2%, ΤῊ = TY ὦ 0, 
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at the point Po in the frame Sp, and which consequently satisfies the 
tensor equation 


} Ty, = 0. (117.1) 
We also introduce the vector ©’ which has the components 
G=f', Φί- 9, 


at the point Po in the frame So, and which is therefore orthogonal to 
the unit velocity vector: 


pr uy = 0. (117.2) 


We now verify that equations (116.5) can be put into the following 
tensor form which is valid in an arbitrary curvilinear coordinate 
system: 

Vee? ελιμ- ΤᾺ) = 6, (117.3) 


In fact, using the Galilean coordinate system Sp at the point Po; 
equations (117.3) reduce to 


c*pd,ul+ a, 0*+8,T7% = fi A=), (117.4) 
c* gp+c? pe, uk+e,7%+8,T% =0 (A= 4), (117.5) 
when account is taken of (116.1) and (116.2). It is easy to evaluate the 
terms @,7*# appearing in (117.4) and (117.5). We have at Pp 
0,T*# = δχί ΤΊ. 
Then using (117.1) we have 
Oy(ug T*) = -- δχίμ ΤΙ) = —2yu,T, 
so that 0,7 = --δλ ΤΊ, 


Substituting this expression for δ. ΤῊ into (117.4) and (117.5) we 
obtain 


(117.6) 


cp agu' + a, t'*—dgu,t" = fi, 
εξ ag4p+c* pé,u*—0,u,t*! = 0. 


Then using equations (116.2) we recover (116.5). Equations (117.3) 
are the fundamental equations of relativistic dynamics for contin- 
uous media. Τ Ἢ is called the relativistic stress tensor. 
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118. The momentum-energy tensor. The form of the fundamental 
equations (117.3) leads us to introduce the symmetric tensor 


1 
p™ = pur ult +—57™, (118.1) 
so that they may be rewritten as 


1 
Vv, pe = — Φὰ 118.2 
μ 2 (118.2) 


P+ is called the momentum-energy tensor of the continuous medium. 
We note that, because of (117.1), 


pm uy, = pur. 


This result can be interpreted by saying that the linear transformation 
defined by the momentum-energy tensor has the unit velocity vector 
as an eigenvector, the corresponding eigenvalue being p. 

Let us now find the momentum-energy tensor for the case in which 
the continuous medium is a perfect fluid. Using the notation of 
(111.3) T has the components 


Tk = gk = ρηῖκ τί - TH = γα = 0 


at the point Pp in the frame So. In arbitrary coordinates, since ΤᾺ 
satisfies (117.1), these take the tensor form 

T = —p(g!¥—uAut), 
Hence 


prt = (+4) wut τ σὰν, (118.3) 


V. THE MAXWELL-LORENTZ EQUATIONS 


119. The electromagnetic field tensor. Maxwell’s theory of the electro- 
magnetic field in a vacuum and the Maxwell-Lorentz electron theory 
involve an electric field vector E and a magnetic field vector H which 
both vary with time. This representation is only useful, however, 
when considering transformations which consist of spatial rotations 
and translations and changes in the time origin. We shall now show 
how the Maxwell-Lorentz equations which govern the electro- 
magnetic field may be expressed in a particularly simple way by using 
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tensors in V4. An important feature of this tensor representation is 
that it has led to a better understanding of the electrodynamics of 
moving bodies. 

Consider a particular reduced Galilean coordinate system 
(x!, x*, x3, x*) which corresponds to a rectangular coordinate system 
Oxyz in space. It is well known that the electric and magnetic fields 
can be defined in such a system by means of the vector potential A 
and the scalar potential ® given by 


H = curl A, 
10A (119.1) 


Let A,, Ay, A, represent the components of A along the axes Oxyz 
and consider the four-vector <p which has the contravariant com- 
ponents 


φ' ΕΝ -Α.» ¢° ᾿Ξ -4,, φ᾽ mi As, d* = -.,κΑὶ, (119.2) 


in the coordinate system (x',x?,x*,x*) of V4. It therefore has the 
covariant components 


φι = A,, $2 = A,, $3 = A,, φ, = -Φ. 
The relations (119.1) can then be written in the form 


(119.3) 


Η, = 8243-342, Ε, = δι φ,-- 444), 
H, = @3¢1— 4, $3, Ey = δγφ,-- ὃ, 2, 
H, = ὃι φ,-- 0244, E, = 0364-4493. 


In other words, the six components of the vectors H and E in space 
are given by the six covariant components of the antisymmetric 
tensor 


ἔλμ - δχφ, - ὃ. φλ (119.4) 


in V4. {In reduced Galilean coordinates we have explicitly 


Hy, = F3, E, = Fia, 
H, = Fy, E, = Fay (119.5) 
H, = Fy, E, = ἔχ. 
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The components of H and E can also be expressed using the contra- 
variant components of the tensor F),, as 


H, δὲ F?3, E, - —Fi 
H, = Ἐ", E, = —F*, (119.6) 
H, = ΕἸΣ E, = —F*. 


Thus in space-time the electromagnetic field is defined by an anti- 
symmetric second-order tensor known as the electromagnetic field 
tensor. ἔλμ is the curl of 4), which is called the four-vector potential. 
The introduction of the electromagnetic field tensor defined by 
(119.4) illustrates the fundamental unity of the electromagnetic field 
and removes the apparent independence of the electric and magnetic 
fields since, according to the tensor transformation law, the com- 
ponents of the electric field vector referred to one Galilean frame 
depend on both the electric and magnetic field components in any 
other frame. 

Consider, for example, two systems of Galilean coordinates (xyz?) 
and (x’y’z’t’). It follows from the equations (108.12) defining the 
Lorentz transformation and from (119.5) that the components of the 
electric and magnetic fields in the second system are given, in terms of 
the components in the first system, by 


Ay = Ay, Ey =. Ey, 
η,- Het BE: ᾿ς Ἐν- βῊ, 
¥ νᾳ-βὴ ᾿ς /(—B?) 
Ὁ Ξε: δ. - ἘΞΈΒΗ, 
7 /(1—f) 7 V/(U- By’ 


The tensor form, (119.4), of equations (119.1) permits a natural 
definition of the electromagnetic field in any system of space-time 
coordinates whether rectilinear or curvilinear. 

It will be seen that the four-vector potential ¢) is not uniquely 
determined by the electromagnetic field. The gradient of an arbit- 
rary scalar field can be added to any four-vector potential satisfying 
(119.4) without modifying these equations. Thus ¢) may be replaced 


by 
φὶ = $+  Ψ. (119.7) 


10 
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This procedure was called a gauge transformation by Hermann Weyl. 
Quantities, such as the electromagnetic field, which are left unchanged 
by a gauge transformation are said to be gauge invariant. 


120. The adjoint electromagnetic field tensor. We saw in §52 that an 
arbitrary antisymmetric tensor can be uniquely associated with 
another antisymmetric tensor called its adjoint. The adjoint electro- 
magnetic field tensor is of order 4—2 = 2, and is defined by 


Fv? = fe"? Fy,,, (120.1) 


in any coordinate system. 
Let space-time be referred to reduced Galilean coordinates. Then 


g=-—land 
ελμνρ — ελμνρ 


where <#”? denotes the usual permutation indicator. In this way we 
get a simple physical interpretation of the components of the adjoint 
tensor in Galilean coordinates 

F? = Fy, = E,, 

F°! = Fy = Ὲ,, (120.2) 
F'? = F 34 = E,. 


ΕΠ = F,, = H,, 
Ε΄“ Ἢ Fy = #,, 
Ε = F,. = H,, 


The adjoint tensor F’“ is therefore an alternative representation of 
the electromagnetic field. 


121. The electric current vector. The Maxwell-Lorentz theory of — 


electrons, expressed in terms of a Galilean frame, is formulated in 


terms of the charge density p and the product of p with the velocity 


of the charge νυ, i.e. py. Suppose the charge to be at rest with respect 


to a Galilean frame Sp and let pp be the corresponding charge density | 


which obviously defines a scalar in V4. 
The quantity of electricity de occupying the volume element dVo is 
given by 
de = Po d Vo. 


Special relativity and Maxwell’s equations 147 


This quantity of electricity must be invariant in V4. The total charge 
on an electron, for instance, is the same with respect to any Galilean 
frame. Thus, for an arbitrary Galilean frame S, we have 


de = pdv, 


where dV denotes the volume element in S which corresponds to 
dV in So. It follows immediately from the equations of the Lorentz 
transformation (108.11) that, if So is moving with respect to S with 
velocity v, we have 

dV = ν(( -- "Ὁ dV. 


Hence 

pv/(1—*)dVo = podVo, 
and p is related to pp by 

Po 
- ----Ξὶξ" 121.1 
P= 7a-PA a 
The four-vector J” defined by the relation 
A 
J* = ρρεμλ (w - “Ἢ, (121.2) 


is called the electric current vector. Substituting the values (112.4) for 
ud, we find that 


i 
Pov Poe 
Ji = ——_> J* = ———;» 
Vv (1—?) V(1—?) 
hence, using (121.1), 
Jé'=pu', J*=pe. (121.3) 


122. The first set of Maxwell-Lorentz equations. The Maxwell- 
Lorentz equations can be split into two sets. The first of these is given 
in ordinary three-vector notation by 


ππτν ΕΠ (122.1) 
c 


divE = 4πρ. (122.2) 
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The x component of (122.1) is given by 
ΘΗ, OH, 12E, pv! 
--Ξ---- ------ = 4r— » 
Ody dz ec δὲ c 


and using (119.6) and (121.3) this may be rewritten 


4a 
8, F'*+0,F%+a,F* = wv 


We can therefore express (122.1) in reduced Galilean coordinates as _ 


dr 


a, ἘΜ = J (122.3) 


Equation (122.2) can be written explicitly as 
OE, 8Ε, 2E, 
oxy ὃν Oz 
or using (119.6) and (121.3), as 


4π 
ὃ. 4. ..ὃ,. F"?+0,F° = "πὸ ἀδ 


Ξε άπρ, 


It follows that the first set of Maxwell-Lorentz equations can, 
using reduced Galilean coordinates, be written in the very simple 
form 


dn 
ὃ. FM = —J", (122.4) 
ς 
Since the coordinate system is rectilinear this is equivalent to 
VFM = εὐ (122.5) 
c 


The tensor form (122.5) of Maxwell’s equations (122.1) and (122.2) 
is also valid for any system of curvilinear coordinates. 


123. The second set of Maxwell-Lorentz equations. Using Galilean 
coordinates, the other set of Maxwell-Lorentz equations are given, in 
three-vector notation, by 

1 dH 


curlE+-— = 0, 
ς 


123.1 
-π- (123.1) 
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divH = 0. (123.2) 
The x-component of (123.1) gives 
8Ε, Εν 10H, _ 4 
Oy @z ὁ Ot . 
or, using (120.2), 


ὃ, F"!* +8; F3+40,F4 = 0, 


Thus the vector relation (123.1) can be translated into reduced 
Galilean coordinates as 


δ ἘῬ = (, (123.3) 
Similarly (123.2) can be written 
OH, | Oy | Hs _ 
ox, Oy Oz.” 


and, again using (120.2), this becomes 
0, Ε΄. 6, F'"+0,F'% = 0. 


It follows that the second set of Maxwell-Lorentz equations can be 
rewritten using reduced Galilean coordinates as 


ὃ ἘΠ» = 0. (123.4) 
Since the coordinate system used is rectilinear this is equivalent to 
VF = 0. (123.5) 


This tensor form is again valid for any system of coordinates. The 
similarity between (122.5) and (123.5) will be noted. 

Equations (123.1) and (123.2) are simple consequences of (119.1); 
they merely express the fact that E and H are derivable from a vector 
potential A and a scalar potential ®. Equations (123.5) can be written 

Ve"? F,,) = 0. (123.6) 
However, remembering (69.3), we can easily verify that 


Vu ελμνρ — Q, 
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Then equations (123.6) take the form 
εἰρνρν, Fy9 = 0 
and, on multiplying by +/|g|, they become 
eM YF, = 0. 
The covariant derivative V,, Εν may be written 

Vil yp = %Fyp— ] SD a be Piste 

From considerations of symmetry 
errr *, = 0, ἐν Γ᾽, = 0, 


hence, using an arbitrary system of coordinates, Maxwell’s equations 
(123.5) may be written 


Elements of tensor calculus 


(123.7) 


eV"? δ. Ε., = 0. 
These equations have the explicit form 
Ou Εν OF ppt op Fuy = 0, 
where μ, v, p denote three distinct indices. They are the necessary 


conditions for the existence of a four-vector ¢) such that F},, is its 
curl. 


(123.8) 


124. Charge conservation. Taking the divergence of (122.1), dif- 
ferentiating (122.2) and combining the equations thus obtained 
leads to the following expression for the law of charge conservation: 


ὃ 
τι div (pv) = 0. (124.1) 


It is easy to reduce this law to its general tensor form. Consider 
first a system of reduced Galilean coordinates. The first set of Max~- 
well-Lorentz equations can, according to (122.4), be written as 


8, FM = 357. (124.2) 
c 


Differentiating with respect to x (and carrying out the appropriate 
summation) we obtain 


4π 
2, FH = τ ὅν. 


[ 
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However, taking account of the antisymmetry of F A and inter- 
changing the repeated indices A and μ, it follows that 
@,F = —8), FHA = —8,, FH = -- 0), F = 0. 
Hence, in reduced Galilean coordinates, 
d,J4 = 0, (124.3) 


which is an alternative form of (124.1). Referring space-time to an 
arbitrary curvilinear coordinate system, (124.3) becomes 


V,J4 = 0. (124.4) 
This is the general tensor form of the law of charge conservation. 


125. The Lorentz force density. Returning to Galilean coordinates, 
let K be the force density given by the Maxwell-Lorentz theory; that 
is the force exerted on the charge contained in unit volume. This is 
given in three-vector notation by the well-known formula 


. = pe+ = ΛΗ. (125.1) 


The x-component of (125.1) is 


ply 


K, = pE,+—*H,—-—— y 


Expressing this relation in terms of the contravariant components of 
the electromagnetic field tensor and the covariant components of the 
current vector we get 


1 1 
K, = Wg F ἘΠΕ" —J3;F'}] = IPM. 
The components Κ' of the space vector K are therefore given by 
1 
‘= i 
K' = nope. 


A fourth component K* may be introduced to complete the set 
K', ΚΞ, Κρ. We then write 

i νὰ λ 

Κὰ = “7, Ἐρλ, (125.2) 


this tensor form being valid in an arbitrary coordinate system. 
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In terms of reduced Galilean coordinates the fourth component 
K* becomes 


1 
K* = εἰσι ΕἸ + Jn ΕΠ Ὁ ΕΗ, 
According to (125.1) this may be written in three-vector notation as 
1 Ι 
K* = —pv-E = -Κ  ν. 
c c 


Apart from the factor 1/c, K* thus represents the work per unit time 
and volume which corresponds to the force density K. 

The four-vector defined by (125.2) is known as the Lorentz force 
density. Since F AH is antisymmetric 


Ι 
KAJ) τ —~FJyJ,, = 0, 
sf ere Wu 
and we see that the force and current four-vectors are orthogonal. 


126. The momentum-energy tensor of the electromagnetic field. Con- 
sider a continuous medium formed by the aggregate of a large 
number of charged particles — electrons for example. If the number is 
sufficiently large the effects due to individual particles will be 
negligible. We therefore consider the total electromagnetic field 
acting on an individual particle to be given independently of the 
behaviour of the particle and represent it by F λμ The continuous 
medium is thus subject to the Lorentz force density given by (125.2). 

It follows that the equations of motion of the continuous medium 
are given by (118.2) with the substitution ®4 = κα}: 


1 

—K*, 126.1 

a ( ) 
It is possible to put the right-hand side of this equation into a 

similar form to that of the left-hand side. Substituting the current 

vector J* given by (122.5) into (125.2) we obtain 


V,,.PAH = 


] 
KK, = — 77 Fm Vp FM. 
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Therefore 
4πΚλ = —V (Ελ, FOP + Fev» Fay 


The second term on the right-hand side gives 
FP V ἔλμ = 4(FHP V pF apt FPPV Fy,) = FPP (V, Fy, +V Fons 


where we have interchanged repeated indices.and used the anti- 
symmetry of F),,. However, from the second set of Maxwell’s 
equations in the form (123.7), we have 


VF aut Vu Fat VaFup = 0 
for any set of indices A, μ, p. Hence 
FRY ἔχ, = — HPO FE ic =— 2V (FP? F..)» 
and we obtain the expression 
—4aKy = V (Fy, FUP) +40 (Fag FP) 
for the Lorentz force density. Relabelling the indices we have 
—42Ky = V,,(—F yg FH + igh Fig F*?), 


We are thus led to introduce the symmetric tensor 
1 
My, = 73 (ten, F? Fag FraF yu): (126.2) 


and equations (126.1) take the simple form 
V,(P#+ Μὴ = 0. (126.3) 


Since the right-hand side is zero, M \# accounts for the electro- 
magnetic interactions and may therefore be added to P™ to obtain 
the total momentum-energy tensor of the continuous medium. M Aus 
defined by (126.2), is known as the momentum-energy tensor of the 
electromagnetic field. In reduced Galilean coordinates the compo- 
nents of this tensor comprise the components of the Maxwell stress 
tensor, the components of the Poynting vector and the energy 
density of the electromagnetic field. 
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CHAPTER VIII 


Elements of the Relativistic Theory 
of Gravitation 


127. Gravitation. The notion of gravitation is based essentially on the 
following experimental fact: all material bodies interact with one 
another at a distance. This interaction is described simply and pre- 
cisely in terms of attractive forces within the framework of classical 
mechanics. Such a description is, however, not in accord with the 
relativistic requirements} previously introduced. 

We shall refer to the law of motion of an infinitesimal test body, 
which is shielded from all electrical and contact interactions, as the 
basic law of gravitation. It is an experimental fact that, when the test 
body is not too far away from other material bodies, its motion differs 
appreciably from the uniform motion in a straight line which is 
implied by the principle of inertia. Consider the space-time manifold 
V, and suppose that it has the metric (111.1). We assume that the 
basic law of gravitation is directly linked with the form of ds? and is 
strictly determined by it. Then, as the Galilean ds* has constant 
coefficients, the basic law must be everywhere the same with respect 
to a Galilean frame. When the test body is far from other material 
bodies it satisfies the principle of inertia; this holds everywhere if the 
test body is completely isolated. Thus, according to the above 
hypothesis, a Galilean ds* is only capable of representing a universe 
devoid of matter, which is therefore also devoid of gravitation. 

In order to represent a universe with gravitation, Einstein intro- 
duced metrics which were more general than the Minkowski metric 
(111.1) of special relativity theory. He considered Riemannian space- 
times, the metrics of which were supposed to determine the basic 
law of gravitation. 

How is it possible to envisage such a relationship ? We have seen in 
§112 that, in a universe without gravitation, the geodesics of the line 


+ For a discussion of this point see Chapter X of Introduction to the 
Theory of Relativity, by P. G. BERGMANN (Prentice-Hall, 1942). (T.) 
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elements with positive ds* define the motion of a material test body 
and those for which ds* = 0 define the paths of light rays. We now 
extend this idea to the more general ds” introduced by Einstein. We 
postulate: 

The geodesic principle: for any distribution of mass and energy, the 
geodesics of the line element of Vs define the motions of material test 
bodies and the paths of light rays. 


128. The metric of general relativity. In Einstein’s conception, the 
universe 18 represented by a four-dimensional Riemannian space V, 
which has the metric 


ds? = gy,,dy*dy# (128.1) 


with signature (+ — — —), ds is called the infinitesimal element of 
the interval in space-time. The physical interpretation of the coordin- 
ates and of the element of interval ds must be made in terms of the 
tangential or osculating Euclidean metrics at a point. At any particular 
point it is possible to bring such a Euclidean metric into the simple 
Minkowski form (111.3). 

Theteng λμ» which correspond to ἃ particular system of coordinates, 
are functions of the γλ, They define completely the basic law of 
gravitation in terms of this system by the geodesic principle. For this 
reason they are called the gravitational potentials. The derivatives of 
these potentials, which appear in the geodesic equations as the 
Christoffel symbols, define the gravitational field in the coordinate 
system under consideration. 

The essential problem in the general theory of relativity is the 
determination of the gravitational potentials which correspond to 
various states of matter in motion. 


129. Einstein’s equations. Einstein was led to partial differential equa- 
tions limiting the generality of the gravitational potentials by two 
essential requirements: on the one hand these equations must 
generalize the equations of Laplace and Poisson which govern the 
Newtonian potential; on the other hand they must be expressible in 
the form of relations between tensors in V,. We assume that they may 
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be written in the form of a relationship between two symmetric 
tensors 


Sru = χύλμ' (129.1) 


where y denotes a constant factor associated with a universal gravita- 
tional constant. The tensor Q),, is taken to describe the energy dis- 
tribution at each point (this being zero in matter free regions). It 
therefore generalizes the right-hand side of Poisson’s equation. 

In order to take account of all forms of energy we identify Q Au with 
the total momentum-energy tensor which was studied in 8118 and 
§126. In the case of a continuous medium with electromagnetic 
interactions we therefore write 


On, = Py +My. (129.2) 


If we introduce an osculating Euclidean metric at a point in V4 the 
results obtained in our study of special relativity become valid locally 
for the space-time of general relativity. In particular Q),, satisfies 
equations (126.3) for the transfer of energy and momentum through 
a continuous medium, i.e. 


V,0™ = 0. (129.3) 


This equation may be interpreted by saying that the tensor ow is 
conservative. It may be compared with (124.4) which represents the 
conservation of charge. 

We assume the tensor δ" appearing on the left-hand side of 
(129.1) to have a purely geometrical significance, which is to say that 
it only involves the metric (128.1) and its derivatives. A priori the ten 
S), cannot be independent. Disposing of the arbitrariness which 
exists in the coordinate system (yp) determines the values of four of the 
potentials. The remaining six potentials would then have to satisfy 
ten independent conditions if the quantities S),, were completely 
independent. 

Since Q),, is conservative (129.1) implies that 


Vv, 5 = 0, (129.4) 


which gives the four relations linking the ten functions S),,. We also 
expect the field equations to be generalizations of the second-order 


᾿ ἡ 


> 


] 
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differential equations of Laplace and Poisson. In fact, the following 
two conditions are sufficient to determine the geometrical tensor S us 
(J) The quantities S),, depend only upon the gravitational poten- 
tials and their first and second-order derivatives. They are linear with 
respect to the second-order derivatives. 
(2)- The quantities 5 λμ satisfy the conservation equations 


VS = 0. 


From the considerations of §85 and §86 we are already aware of a 
tensor which satisfies the above conditions. This is given by 


Ryu dR, 


where R Au is the Ricci tensor and R is the scalar Riemannian curva- 
ture of the Riemannian space (128.1). Elie Cartan has shown that the 
only tensors satisfying the above conditions are given by 


Shy = A[Ry,.—4¢y,(R+ 4, 


where h and k are constants. The corresponding partial differential 
equations may be written 


if we suppress the superfluous factor ἡ. Except in certain very special 
cosmological studies one is only concerned in general relativity with 
equations where & is zero (which were first given by Einstein), 
Einstein’s gravitational equations are therefore given by 


Κλμ-- 88 Ay R = XQ, (129.5) 
or, in the absence of gravitating matter, 
Ry bga,R = 0. (129.6) 
Equations (129.6) are equivalent to 
Ry, = 0, (129.7) 
as these equations immediately imply that R = 0. 
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130. The momentum-energy tensor. The momentum-energy tensor 
Oru which appears on the right-hand side of the Einstein equations 
should describe completely the distribution of matter and energy in 
motion. It is necessary, as in Newtonian mechanics, to be content 
with an incomplete description which corresponds to a tensor Q),, of 
restricted complexity. This tensor contains various terms which 
correspond to the different kinds of energy: rest-mass energy, kinetic 
energy, energy deriving from stresses, and from the electromagnetic 
field etc. In the presence of matter the term which is experimentally 
the most important is that which corresponds to the rest-mass 
energy. 

We note that, because of the non-uniformity of Riemannian space, 
one cannot suppose material bodies to be rigid as in classical 
mechanics. We are thus forced to consider models which are hydro- 
dynamical in origin; this is why the study of continuous media plays 
such an important role in the relativistic theory of gravitation. 

Many forms of the momentum-energy tensor are commonly used 
in relativistic gravitational theory. Consider first a continuous 
medium such that interactions in the interior of the medium are 
negligible and there are no electromagnetic interactions. The cor- 
responding form of the momentum-energy tensor is given by 


O' = pudult, (130.1) 


where p is the density of the matter at rest and where μλ is the unit 
velocity vector of the matter distribution at the point considered. 
The tensor (130.1) corresponds only to the rest mass and kinetic 
energy. 

If interactions defined by a stress tensor 7},, exist inside the medium 
we have 


ow = pur ul + ΤᾺ, (130.2) 
In particular, in the case of a perfect fluid (118.3) gives 
QM = (r+ 5) μλιμ-- ἔν σὰν, (130.3) 


where p denotes the pressure. 
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The lines in V4 which are everywhere tangential to the unit velocity 
vector μὰ are called the flow lines of the distribution of matter. 

The electromagnetic field in the absence of matter gives rise to the 
momentum-energy tensor defined by (126.2). 


131. Conservation equations inside matter. Let us apply the conserva- 
tion equations 


V,0% =0 (131.1) 

to the momentum-energy tensor given by (130.1): it follows that 
V (putt) w+ putV,,u* = 0. (131.2) 

However, since wis a unit vector, we have 

uyVu ud = 0, 
Hence, after contracted multiplication by u), (131.2) gives 

V (pu) = 0, (131.3) 
and 

uHV ud = 0. (131.4) 


The geometrical interpretation of these equations is very simple: 
(131.3) states that the divergence of the vector pu", the generalized 
matter flux, is zero; this is the equation of continuity for the matter. 
(131.4) states that the flow lines are geodesics of ds”, this result being 
directly related to the geodesic principle. Analogous but more com- 
plicated results have been given by Eisenhart and Lichnerowicz for 
the case in which there is an internal stress in the medium. 

In view of the scope of this work we are forced to restrict the 
present discussion to these relatively elementary considerations, and 
to refer the reader who desires a more detailed account to the special- 
ized treatises listed in the bibliography. 
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